
REO'Q 2 2 JMN 2E006 



P2 1320744 



UWIUliiiUWil.Mimi.nllm.MIIIU 




June 17, 2005 



UNITED STATES DEPARTMENT OF COMMERCE 
United States Patent and Trademark Office 



THIS IS TO CERTIFY THAT ANNEXED HERETO IS A TRUE COPY OF 
THE BELOW IDENTIFIED INTERNATIONAL APPLICATION AS 
ORIGINALLY FILED AND ANY CORRECTIONS THERETO FROM THE 
RECORDS OF THE UNITED STATES PATENT AND TRADEMARK 
OFFICE ACTING AS A RECEIVING OFFICE UNDER THE PATENT 
COOPERATION TREATY. 

APPLICATION NUMBER: PCT/US04/06308 
FILING DATE: March 02, 2004 



By Authority of the 

COMMISSIONER OF PATENTS AND TRADEMARKS 




H. L! 
Certifying'Officer 



V 



W / r 



P G T/ U S 01 HI-/ O 6 3 O B 



DT02Rec f dPCT/PTO 0 2 MAR 2004 



TRANSMITTAL LETTER TO THE 
UNITED STATES RECEIVING OFFICE 



K Certificate under 37 CFR 1.10 (if applicable) 



Date 



2 March 2004 



International Applical 



Attorney Docket No. 



_| PKOL-PWD-024 



EV323 524 199US 



Express Mail mailing number 



2 March 2004 



Date of Deposit 



tender 

coiTespondence " 



Signature of person mailing correspond* 



~x] New International Application 




Typed or printed name of person maflinVcom 



TITLE 



POSH INTERACTING PROTEINS AND RELATED 
METHODS 



Eariiest priority date 
(Day/Mooth/Year) 



03/03/03 



SCREEN DISCLOSURE INFORMATION: In order to assist in screening the accompanying inter national application for 
purposes of determining for foreign transmittal should and could be granted and foroEuS 

information is supplied. (Note: check as many boxes as apply): purposes, tne toiiowmg 

A - The invention disclosed was not made in the United States. 

B - There is no prior U.S. application relating to this invention. 

C [x] The following prior U.s. applications) contain subject matter which is related to the invention disclosed in the attached 
and this listing does not constitute a claim for priority.) ■ ? 



AppNo 
App No 



AppNo 
App No 



60/451 ,437 filed 3 March 2003 



60/452,284 filed 5 March 2003 



AppNo 



AppNo 



AppNo 



60/475,825 filed 3 June 2003 



60/479,317 filed 17 June 2003 



60/480,376 filed 19 June 2003 



60/456,640 filed 20 March 2003 



AppNo 



60/480,215 filed 19 June 2003 



AppNo 
AppNo 



60/460,526 filed 3 April 2003 



App No 



60/493,860 filed 8 August 2003 



AppNo 
AppNo 



60/464,285 filed 21 April 2003 



App No 



60/469,462 filed 9 May 2003 
60/471,378 filed 15 May 2003 



AppNo 



60/503,931 filed 16 September 2003 



60/455,760 filed 19 March 2003 



AppNo 



AppNo 
AppNo 



60/472, 327 filed 20 May 2003 



AppNo 



60/460,792 filed 4 April 2003 



60/498,634 filed 28 August 2003 



AppNo 



60/474,706 filed 30 May 2003 



App No 



A PCT application filed on Februaiy 5, 2004 
(Attorney Docket No. PROL-PWO-039), in the name 
of Iris Alroy, Daniel Taglicht, Yaval Reiss, Uora 
Yaar, and Shmuel Tuvia entitlted "Posh Associated 
Kinases and Related Methods." 



App No 



US03/35712 filed 10 November 2003 



A provisional application filed on March 2, 2004, (Attorney Docket 
No. PROL-P79-024), in the name of Daniel N. Taglicht, Iris Alroy, 
Yuval Reiss, Liora Yaar, Danny Ben-Avraham, Shmuel Tuvia and 
Tsvika Greener entitled "Posh Interacting Proteins and Related 
Methods.** 



D. [7] The present international application [7] contains additional subject matter not found in the prior US. application^) 

idetlttfiexi in namaranU n a k«.. TU JJii! i i . . . _ I ~ — — — — 



identified in paragraph C. above; The additional subject matter is found on pages 



□ ■ ■»"•- -f^Z -w»jvw uioucf is iouna on pages | THROUGHOUT 
DOES NOT ALTER |x] MIGHT BE CONSIDERED TO ALTER the general nature of the invention in i 



IV. Q A Request for Rectification under PCT Rule 91 [] A Petition 



A Sequence Listing Diskette 



V. |X] Otherfrleasespeci*, 



The person 
signing this 
form is the: 



L 1 A PPHcant . 


Kathleen Ehrhard 


| X | Attorney/Agent Reg. No. 
P-55 t 144 


_ — - Typed name of signer 


( J Common Representative 


Signature 
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REQUEST 

The undersigned requests that the present 

international application be processed 
according to the Patent Cooperation Treaty. 



International Applii 



For receiving Office use only i— 

PCI/US OA/06308 



International Filing Date 



E 



emational Application" 



mm 



Of desired) (12 characters maximum) 



PROL-PWO-024 



Box No. I TITLE OF INVENTION 

POSH INTERACTING PROTEINS AND RELATED METHODS 



| | This person is also inventor 



Box No. n APPLICANT __ _____ 

Name and address: (Family name followed by given name; for a legal entity. fW official oteg^on. 
Vie address must include postal code and name of country. The country of the address indicated in this 
Box is the applicant's State (that is, country) of residence if no State of residence is indicated below.) 

PROTEOLOGICS, INC. 
40 Ramland .Road South 
Suite 10 

Orangeburg, New York 10962 
United States of America 



Telephone No. 



Facsimile No. 



Teleprinter No. 



Applicant's registration No. with the Office 



-State (that is, country) of nationality: 

US 



State (that is, country) of residence: 
US 



This person is applicant Fl all designated Fn all designated States except I | the United States 
fftfSS^rf! U States Li] the United States of America L_l ofAmencaonly 



□ the States indicated in 
the Supplemental Box 



Box No. Ill F URTHER APPLICANT(S) AND/OR (FURTHER) INVENT OR(S) 

Name and address: (Family name followed by given name; for a legal entity, fall i^cM de^a^ 
Vre address must include postal code and name of country. The country of the adaVess indicated m this 
Box is the applicant's State (that is, country) of residence if no State of residence is indicated below.) 



TAGLICHT, Daniel N. 

Lapid 

Israel 



State (that is, country) of nationality: 

IL 



This person is: 
| | applicant only 
| x | applicant and inventor 

□ inventor onry (If this check-box 
is marked, do not fill in below.) 



Applicant's registration No. with the Office 



State (that is, country) of residence: 
IL 



This oerson is applicant I 1 all designated I 1 all designated States except f—\ the United States 

forfc^^egg U States LJ the United S tates of America LiU of America only 



□ the States indicated in 
the Supplemental Box 



jTJ Further applicants and/or (further) inventors are indicated on a continuation sheet 
Box No. IV 



AGENT OR COMMON REPRESENTATIVE; OR ADDRESS FOR CORRESPONDENCE 



The person identified below is hereby/has been appointed to act on behalf 
of the applicants) before the competent International Authorities as: 



agent 



□ 



common 
representative 



Name and address: (Family name follow by given name; for a legal entity, fall official designation. 
The address must include postal code and name of country.) 

EHRHARD, Kathleen 
Ropes & Gray LLP 
One International Place 
Boston, Massachusetts 02110-2624 
United States of America 



Telephone No. 

(617) 951-7037 



Facsimile No. 

(617) 951-7050 



Teleprinter No. 



Agent's registration No. with the Office 
P-55,144 



□ Address for correspondence: Mark this check-box where no agent or common representative is/has been appointed and the 
space above is used instead to indicate a special address to which correspondence should be sent 



See Notes to the request form 
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Continuation of Box No. in FURTHER APPLICANT(S) AND/OR (FURTHER) INVENTORY) 

If none of the following sub-boxes is used, this sheet should not be included in the request 



Name and address: (Family name followed by given name; for a w - - , ,.., Uie 
The address must include postal code and name of country. The country f the aMress indicated m this 
Box is the applicant's State (thai is, country) of residence if no State of residence is indicated below.) 

ALROY, Iris 
Hashirion Street 10/17 
74065 Nes Ziona 
Israel 



This person e: 
| | applicant only 

[~x] applicant and inventor 

□ inventor only (If this check-box is 
marked, do not fill in below,) 



Applicant's registration No. with the Office 



State (that is, country) of nationality: 

IL 



State (that is, country) of residence: 
IL 



This person is applicant I 1 all designated I 1 all designated States except 

for the purposes of: LJ States l_l the United States of America 



H the United States 
of America only 



□ 



the States indicated in 
the Supplemental Box 



Name and address: (Family name followed by given name; for a legal entity, Jull official desigmtion. 
The address must include postal code and name of country. The ^ountry ofthe aa^n^ed in this 
Box is the applicants State (that is, country) of residence if no State of residence is indicated below.) 

REISS, Yuval 
Hahavazelet 11/6 
Kiriat-ono 
Israel 



This person is: 
| I applicant only 

[ x] applicant and inventor 

□ inventor only (If this check-box is 
marked, do not fill in below.) 



Applicant's registration No. with the Office 



State (that is, country) of nationality: 

IL 



State (that is, country) of residence: 
IL 



This person is applicant I | all designated j | 
for the purposes of: I 1 States I 1 



all designated States except 
the United States of America 



the United States 
of America only 



□ 



the States indicated in 
the Supplemental Box 



Name and address: (Family name followed by given name; far a legal entity. &l °ffi«* <%^°* 
The address must include postal code and name of country. The country of the address indicated in this 
Box is the applicant's State (that is. country) of residence if no State of residence is indicated below.) 

YAAR, Liora 
8 Kalisher Street 
43354 Raanana 
Israel 



This person is: 
| | applicant only 

[ xj applicant and inventor 

□ inventor only (If this check-box is 
marked, do not fill in below.) 



Applicant's registration No. with the Office 



State (that is, country) of nationality: 

IL 



State (that is, country) of residence: 
IL 



This person is applicant I I all designated I 1 alldesij 
1 | States I I theUni 



for the purposes of: 



the United States of America 



x 



the United States 
of America only 



□ 



the States indicated in 
the Supplemental Box 



Name and address: (Family name followed by given ^l^ al ^^'^^^^Z, 
The address must Include postal code and name of country. 



BEN-AVRAHAM, Danny 
Igal Alon 20 
Zichron Jackov 
Israel 



This person is: 
| [ applicant only 

| x| applicant and inventor 

□ inventor only (If this check-box is 
marked, do not fill in below.) 



Applicant's registration No. with the Office 



State (that is, country) of nationality: 

IL 



State (that is, country) of residence: 
IL 



This person is applicant [ I all designated I I 
I I States I I 



for the purposes of: 



I designated States except Qjj the United States 



the United States of America 



of America only 



□ the States indicated in 
the Supplemental Box 



Jjcj Further applicants and/or (further) inventors are indicated on, another continuation sheet 



Form PCT/RO/101 (continuation sheet) (January 2004) 
9382278.1 



See Notes to the request form 
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Continuation of Box No. HI FURTHER APPLICANT(S) AND/OR (FURTHER) INVENTORY) 



If nam of the following sub-boxes is used, this sheet should not be included in the request 



Name and address: (Family name followed by given namejbr a legal ^'^^^ff 1 ^ 
^address must include postal code and name of county. ^'c^^f^^T 
BoxltoeapplicantsSt^ 



TUVIA, Shmuel 
Hartzit 1 
42490 Netanya 
Israel 



This person is: 
| ~J applicant only 

| x| applicant and inventor 

□ inventor only (If this check-box is 
marked, do not fill in below,) 



Applicant's registration No. with the Office 



State (that is, country) of nationality: 

IL 



State (that is, country) of residence: 
IL 



This person is applicant I 1 all designated I | an designated States except 

for the purposes of: l_J States LJ the United States of Amenca 



the United States 
of America only 



□ the States indicated in 
the Supplemental Box 



Name and address: (Family name followed by given name; for alegpl ^^^^TZ 
The address must include postal code and name of country. The <^J^!^J^^ 
Baxistheapplicant'sState (that is, country) of residence if no State of residence is indicated below) 

GREENER, Tsvika 
Hahavazelet 9a 
Ness Ziona 
Israel 



This person is: 
[ J applicant only 

j x| applicant and inventor 

□ inventor only (If this check-box is 
marked, do not fill in below.) 



Applicant's registration No. with the Office 



State (that is, country) of nationality: 

IL 



State (that is, country) of residence: 
IL 



This person is applicant I 1 all designated ( I all designated States except 

forme purposes [of: U States LJ the United States of Amenca 



Hthe United States 
of America only 



□ the States indicated in 
the Supplemental Box 



Name and address: (Faimlynamefollowedbygtvmname;fora 
ThTaddress must include postal code and name of country. The country of the address indicated in this 
country) of residence if no State of residence is indicated below.) 



This person is: 
F | applicant only 

| | applicant and inventor 



□ 



inventor only (If this check-box is 
marked, do not fill in below.) 



Applicant's registration No. with the Office 



State (that is, country) of nationality: 



State (that is, country) of residence: 



This person is applicant | 1 all designated I | all designated States except 

for ^purposes of: U States l_J the United States of Amenca 



□ the United States 
of America only 



□ the States indicated in 
the Supplemental Box 



"Name and address: (Family name followed by given name; for a legal entry, ^if c ^^^ h l 
Theaddress must include postal code and name of country. The ^^jf^ff^^f^ ^ 
Baxistheapplicant'sState (that is, country) of residence if no State of residence is indicated below.) 



This person is: 
| ) applicant only 



| | applicant and inventor 

□ 



inventor only (If this check-box is 
marked, do not fill in below.) 



Applicant's registration No. with the Office 



State (that is, country) of nationality: 



State (that is, country) of residence: 



This person is applicant I 1 all designated | | all designated States except 

for the purposes of: I 1 States 



□ all designated States except | I the United States 
the United States of America I 1 of America only 



□ the States indicated in 
the Supplemental Box 



[~"~] Further applicants and/or (further) inventors are indicated on another continuation sheet 



Form PCT/RO/101 (continuation sheet) (January 2004) 
9382278.1 



See Notes to the request form 
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Supplemental Box If the Supplemental Box Is not used, this sheet should not be included in the request. 



Ap 
pli 
cat 
ion 
No 



00 



If in any of the Boxes except Boxes Nos. VJIIfi) to (v) for which 
a special continuation box is provided, the space is insufficient to 
furnish all the information: in such case, vrrite "Continuation of 
box No. .... " (indicate the number of the Box) and furnish the 
information in the same manner as required according to the 
captions of the Box in which the space was insufficient, in 
particular: 

if more than two persons are to be indicated as applicants 
and/or inventors and no "continuation sheet" Isavallable: in 
such case, write "Continuation of Box No. Ill" and indicate for 
each additional person the same type of information as required 
in Box No. III. The country of the address indicated in this Box 
is the applicant's State (that is, country) of residence \f no State 
of residence is indicated below: 

if, in Box No. II or in any of the sub-boxes of Box No. Ill, the 
indication u the States indicated in the Supplemental Box" is 
checked: in such case, write "Continuation of Box No. II" or 
"Continuation of Box No. Ill" or "Continuation of Boxes No. II 
and No. Ill" (as the case may be), indicate the name of the 
applicants) involved and, next to (each) such name, the State(s) 
(and/or, where applicable, ARIPO, Eurasian, European or 
OAPI patent) for the purposes of which the named person is 



Continuation of Box No. VI 



2. 



3. 



if, in Box No. II or in any of the sub-boxes of Box No. Ill, the 
inventor or the inventor/applicant is not inventor for the 
purposes of all designated States or for the purposes of the 
United States of America: in such case, write "Continuation of 
Box No. II" or "Continuation of Box No. Ill" or "Continuation 
of Boxes No. 11 and No. Ill" (as the case may be), indicate the 
name of the inventor (s) and, next to (each) such name, the State (s) 
(and/or, where applicable, ARIPO, Eurasian, European or 
OAPI patent) for the purposes of which the named person is 
inventor; 

if, in addition to the agent (s) indicated in Box No. IV, there are 
further agents: in such case, write "Continuation of 
Box No. IV" and indicate for each further agent the same type 
of information as required in Box IV; 

if, in Box No. VI, there are more than three earlier applications 
whose priority is claimed: in such case, write "Continuation of 
Box No. VI" and indicate for each additional earlier application 
the same type of information as required in Box No. VI 

If the applicant intends to make an indication of the wish that 
the international application be treated, in certain designated 
States, as an application for a patent of addition, certificate of 
addition, inventor's certificate of addition or utility certificate 
of addition: in such a case, write the name or two-letter code 
ofeacli designated States concerned and the indication "patent 
of addition, " "certificate of addition, " "inventor's certificate 
of addition * or M utility certificate of addition, " the number of 
the parent application or parent patent or other parent grant 
and the date of grant of the parent patent or other patent grant 
or the date of filing of the parent application (Rules 4.ll(a)(iiQ 
and49bis.l(a)or(b)). 

i 

If the applicant intends to make an indication of the wish that the 
international application be treated, in Out United States of 
America, as a continuation or continuation-in-part of an earlier 
application: in such a case, write "United States of America" 
or "US" and the indication "continuation" or "continuation- 
in-part" and the ntanber and the filing date of tlte parent 
application (Rules 4.ll(a)(tv) and 49bis. 1(d)). 



Application; 60/452284 



Application: 60/456640 
Application: 60/460526 
Application: 60/464285 



Application: 
Application; 



60/471370 
60/472327 
60/474706 



Application: 60/475825 
Application: 60/479317 
Application: 60/480215 
Application: 60/493860 



Application: 



(4) Date: 05 March 2003 (05/03/03) 
National Application Country: US 

(5) Date: 20 March 2003 (20/03/03) 
National Application Country: US 

(6) Date: 03 April 2003 (03/04/03) 
National Application Country: US 

(7) Date: 21 April 2003 (21/04/03) 
National Application Country: US 

(8) Date: 15 May 2003 (15/05/03) Application: 
National Application Country: US 

(9) Date: 20 May 2003 (20/05/03) 
National Application Country: US 

(10) Date: 30 May 2003 (30/05/03) 
National Application Country: US 

(11) Date: 03 June 2003 (03/06/03) 
National Application Country: US 

(12) Date: 17 June 2003 (17/06/03) 
National Application Country: US 

(13) Date: 19 June 2003 (19/06/03) 
National Application Country: US 

(14) Date: 08 August 2003 (08/08/03) 
National Application Country: US 

(15) Date: 16 September 2003 (16/09/03) 
60/503931 

National Ajapjication Country: US 

(16) Date: oJrMarch 2004 (02/03/04) Application: Aprowsknal 
application fifed en March 2, 2004, (Attorney Docket No. PROLrP79-024X in the name of 
Daniel KTaglicht, Iris Alroy, Yuval Ross, Lkxa Yaar, Danny Ben-Avraham, Shmucl Tuvia and 
TsvikaGreaierenrilled"^ 

National Application Country: US 

(17) Date: 03 March 2003 (03/03/03) 
60/451,437 

National Application Country: US 

(18) Date: 09 May 2003 (09/05/03) 
60/469,462 

National Application Country: US 

(19) Date: 19 June 2003 (19/06/03) 
60/480,376 

National Application Country: US 

(20) Date: 10 November 2003 (10/11/03) 
US03135712 

National Application Country: US 

(21) Date: PCT filed 05 February 2004 (Attorney Docket No. 
PROL-PWO-039) in the name of Iris Alroy, Daniel Taglicht, 
Yuval Reiss, Liora Yaar, and Shmuel Tuvia entitled "Posh 
Associated Kinases and Related Methods" 

Continuation of Box No. IV: 

Steven Baglio, 51,426; J. Steven Baughman, 47,414; Mark W. 
Belloray, 51,452; John V. Bianco, 36,748; Johnny Y. Chen, 
46,614; James P. Deroers, 34,320; Gojeb L. Frehywot, 52,916; 
Gloria Fuentes, 47,580; Gregory Glover, 34,173; William G. 
Gosz, 27,787; Patricia Granahan, 32,227; Z. Angela Guo, 
54,144; David P. Halstead, 44,735; Margaret E. Jamroz, 
54,196; Edward J. Kelly, 38,936; Charles Larsen, 48,533; 
Agnes S. Lee, 46,862; Paul E. Lewkowicz, 44,870; Weishi Li, 
53,217; Yu Lu, 50,306; Alexander Manganiello, 53,264; Robert 
A. Mazzarese, 42,852; Christopher Natkanski, 50,365; R. 
Daniel O'Connor, P54,343; Ignacio Perez de la Cruz, 55,535; 
Melissa S. Rones, Ph.D., 54,408; Spencer H. Schneider, 
45,923; Sanjay Sitlani, 48,489; Wolfgang E. Stutius, 40,256; 
Erika Takeuchi, 55,661; Lisa Treannie, 41,368; Anita Varna, 
43,221; Matthew P. Vincent, 36,709; Dalila Argaez Wendlandt, 
52,351; and Levina Wong, P54,551 

And all other agents of: 
ROPES & GRAY LLP, Patent Group 
One International Place 
Boston, Massachusetts 02110-2624 
United States of America 
Customer ID No: 28,120 



Application: 



Application: 



Application: 



Application: 



Form PCT/RO/101 (January 2004) 
9382278.1 
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Box No. V DESIGNATIONS . . 

The filing of this request constitutes under Rule 4.9(a), the designation of all Contracting States bound by the PCT on the international 
filing date, for the grant of every kind of protection available and, where applicable, for the grant of both regional and national patents. 

H owev er, 

[ | DE Germany is not designated for any kind of national protection. 

□ 
□ 



KR Republic of Korea is not designated for any kind of national protection. 
RU Russian Federation is not designated for any kind of national protection. 



(The check-boxes above may be used to exclude (irrevocably) the designations concerned in order to avoid the ceasing of the effect, under 
the national Imv, of an earlier national application from which priority is claimed See the Notes to Box No. V as to the consequences of 
such national law provisions in these and certain other States.) m , 



Box No, VI PRIORITY CLAIM 



The priority of the following earlier application(s) is hereby claimed: 



Filing date 
of earlier application 
. (day/month/year) 


Number 
of earlier application 


Where earlier application is: 


national application: 
country or Member 
of WTO 


regional application:* 
regional Office 


international application 
receiving Office 


item(l) 


19 March 2003 
(19.03.2003) 


60/455760 


US 






item (2) 


28 August 2003 
(28.08.2003) 


60/498634 


US 






♦ item (3) 


04 April 2003 
(04.04.2003) 


60/460792 


us 







| x| Further priority claims are indicated in the Supplemental Box. 



The receiving Office is requested to prepare and transmit to the International Bureau a certified copy of the earlier application(s) (only if 
the earlier application was filed with the Office which for the purposes of this international application is the receiving Office) identified 
above as: 

^allitems Q item(l) \J item (2) Q item (3) Q other, see Supplemental Box 

* mere the earlier application is anARlPO application, indicate at least one country party to the Paris Convention for the Protection of 
Industrial Property or one Member of the World Trade Organization for which that earlier application was filed (Rule 4JO(b)0i)): 



Box No. VII INTERNATIONAL SEARCHING AUTHORITY 



Choice of International Searching Authority (ISA) (if two or more International Searching Authorities are competent to carry out Hie 
international search, indicate the Authority chosen; the two-letter code maybe used): 

ISA /US 

Request to use results of ea rlier search; reference to that search Of an earlier search has been carried out by or requested from the 
International Searching Authority): 

Date (day/montlifyear) Number Country (or regional Office) 



BoxNo.Vm DECLARATIONS 



The following declarations are contained in Boxes Nos. VIII (i) to (v) (mark the applicable Number of 

check-boxes below and indicate in the right column the number of each type of declaration): declarations 

| | Box No. Vm (i) Declaration as to the identity of the inventor : 

[ n Box No. Vm (ii) Declaration as to the applicant's entitlement, as at the international filing 

' ' date, to apply for and be granted a patent 

f~l Box No. Vm (iii) Declaration as to the applicant's entitlement, as at the international filing 

' 1 date, to claim the priority of the earlier application 

I I Box No. Vm (iv) Declaration of inventorship (only for the purposes of the designation of the . 

1 — 1 United States of America 

| | B 0 x No. VM (v) Declaration as to non-prejudiaal disclosures or exceptions to lack of novelty : 



Form PCT/RO/101 (second sheet) (January 2004) 
9382278.1 
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Box No. DC CHECK LIST; LANGUAGE OF FILING 



This international application contains: 



(a) in paper form, the following number of 
sheets: 



155 
16 
1 

202 



request (including 
declaration sheets) 
description (excluding 
sequence listings and/or 
tables related thereto) 

claims 

abstract 

drawings 

Sub-total number of sheets 
sequence listings 
tables related thereto 
(for both, actual number of sheets 
If filed in paper form, wl tether 
or not also filed in computer 
readable form; see (c) below) 
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POSH INTERACTING PROTEINS AND RELATED METHODS 

RELATED APPLICATIONS 

This application claims the benefit of priority of U.S. Provisional 
Application number 60/451,437 filed 3 March 2003; 60/452,284 filed 5 March 

5 2003; 60/456,640 filed 20 March 2003; 60/460,526 filed 3 April 2003; 60/464,285 
filed 21 April 2003; 60/469,462 filed 9 May 2003; 60/471,378 filed 15 May 2003; 
60/472,327 filed 20 May 2003; 60/474,706 filed 30 May 2003; 60/475,825 filed 3 
June 2003; 60/479,317 filed 17 June 2003; 60/480,376 filed 19 June 2003; 
60/480,215 filed 19 June 2003; 60/493,860 filed 8 August 2003; 60/503,931 filed 16 

10 September 2003; 60/455,760 filed 19 March 2003; 60/460,792 filed 4 April 2003; 
60/498,634 filed 28 August 2003; and a provisional application filed on March 2, 
2004, (Attorney Docket No. PROL-P79-024), in the name of Daniel N. Taglicht, Iris 
Alroy, Yuval Reiss, Liora Yaar, Danny Ben-Avraham, Shmuel Tuvia, and Tsvika 
Greener entitled "Posh Interacting Proteins and Related Methods"; a PCT 

15 application US03/35712 filed 10 November 2003; and a PCT application filed on 
February 5, 2004, (Attorney Docket No. PROL-PWO-039), in the name of Iris 
Alroy, Daniel Taglicht, Yuval Reiss, Liora Yaar, and Shmuel Tuvia entitled "Posh 
Associated Kinases and Related Methods". The teachings of the referenced 
Applications are incorporated herein by reference in their entirety. 

20 

BACKGROUND 

Potential drug target validation involves determining whether a DNA, RNA 
or protein molecule is implicated in a disease process and is therefore a suitable 
target for development of new therapeutic drugs. Drug discovery, the process by 

25 which bioactive compounds are identified and characterized, is a critical step in the 
development of new treatments for human diseases. The landscape of drug 
discovery has changed dramatically due to the genomics revolution. DNA and 
protein sequences are yielding a host of new drug targets and an enormous amount 
of associated information. 

30 The identification of genes and proteins involved in various disease states or 

key biological processes, such as inflammation and immune response, is a vital part 
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of the drug design process. Many diseases and disorders could be treated or 
prevented by decreasing the expression of one or more genes involved in the 
- molecular etiology of the condition if the appropriate molecular target could be 
identified and appropriate antagonists developed. For example, cancer, in which one 
5 or more cellular oncogenes become activated and result in the unchecked 
progression of cell cycle processes, could be treated by antagonizing appropriate cell 
cycle control genes. Furthermore many human genetic diseases, such as 
Huntington's disease, and certain prion conditions, which are influenced by both 
genetic and epigenetic factors, result from the inappropriate activity of a polypeptide 
0 as opposed to the complete loss of its function. Accordingly, antagonizing the 
aberrant function of such mutant genes would provide a means of treatment. 
Additionally, infectious d iseases such as HIV have been successfully treated with 
molecular antagonists targeted to specific essential retroviral proteins such as HIV 
protease or reverse transcriptase. Drug therapy strategies for treating such diseases 
5 and- disorders have frequently employed molecular antagonists which target the 
polypeptide product of the disease gene(s). However, the discovery of relevant gene 
or protein targets is often difficult and time consuming. 

One area of particular interest is the identification of host genes and proteins 
that are co-opted by viruses during the viral life cycle. The serious and incurable 
10 nature of many viral diseases, coupled with the high rate of mutations found in many 
viruses, makes the identification of antiviral agents a high priority for the 
improvement of world health. Genes and proteins involved in a viral life cycle are 
also appealing as a subject for investigation because such genes and proteins will 
typically have additional activities in the host cell and may play a role in other non- 
25 viral disease states. 

Other areas of interest include the identification of genes and proteins 
involved in cancer, apoptosis and neural disorders (particularly those associated with 
apoptotic neurons, such as Alzheimer's disease). 

It would be beneficial to identify proteins involved in one or more of these 
30 processs for use in, among other things, drug screening methods. Additionally, once 
a protein involved in one or more processes of interest has been identified, it is 
possible to identify proteins that associate, directly or indirectly, with the initially 
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identified protein. Knowledge of interactors will provide insight into protein 
assemblages and pathways that participate in disease processes, and in many cases 
an interacting protein will have desirable properties for the targeting of therapeutics. 
In some cases, an interacting protein will already be known as a drug target, but in a 
5 different biological context. Thus, by identifying a suite of proteins that interact 
with an initially identified protein, it is possible to identify novel drug targets and 
new uses for previously known therapeutics. 

SUMMARY 

10 This application provides isolated, purified or recombinant complexes 

comprising a POSH polypeptide and one or more POSH-associated protein (POSH- 
AP). In certain aspects, the POSH-AP comprises a polypeptide selected from the 
group consisting of: PKA, SNX1, SNX3, ATP6V0C, PTPN12, PPP1CA, GOSR2, 
CENTB1, DDEF1, ARF1, ARF5, PACS-1, EPS8L2, HERPUD1, UNC84B, 

1 5 MSTP028, GOCAP, EDF3S3, SRA1, CBL-B, RALA, SIAH1, SMN1, SMN2, 

SYNE1, TTC3, VCY2IP1 and UBE2N (UBC13). In other aspects, the POSH-AP 
comprises a polypeptide selected from the group consisting of: ARHV (Chp), 
WASF1, HIP55, SPG20, HLA-A, and HLA-B. In further aspects, the POSH-AP 
comprises one or more polypeptides set forth in Table 8. In certain embodiments the 

20 POSH polypeptide is a human POSH polypeptide. 

In certain embodiments, this application provides isolated, purified or 
recombinant complexes comprising a HERPUD1 polypeptides and a ubiquitin 
ligase, examples of the ubiquitin ligase include CBL-B, TTC3, and SIAH1. 

In certain embodiments, the application provides methods for identifying 

25 agents that modulates an activity of a POSH polypeptide or POSH-AP, comprising 
identifying an agent that disrupts a complex of a POSH polypeptide and a POSH- 
AP, wherein an agent that disrupts such a complex is an agent that modulates an 
activity of the POSH polypeptide or the POSH-AP. 

In yet other embodiments, the application provides methods of identifying an 

30 antiviral agent, comprising identifying a test agent that disrupts a complex 

comprising a POSH polypeptide and a POSH-AP and evaluating the effect of the 
test agent on either a pro-infective or pro-replicative function of a virus is an 
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antiviral agent, wherein an agent inhibits such a function of a virus is an antiviral 
agent. In certain embodiments the POSH-AP is selected from the group consisting • 
of: PKA, SNX1, SNX3, PTPN12, GOSR2, CENTB1, ARF1, ARF5, PACS-1, 
EPS8L2, HERPUD1, SMN1, SMN2, UNC84B, MSTP028, GOCAP, CBL-B, 
5 SYNE1, UBE2N (UBC13), SIAH1, TTC3, WASF1, HIP55, RALA, and SPG20. 
Examples of such viruses include for example, envelope viruses such as the Human 
Immunodeficiency Virus, the West Nile Virus, and the Moloney Murine Leukemia 
Virus (MMuLV). 

In other embodiments, the application provides methods of identifying an 
10 anti-apoptotic agent, comprising identifying a test agent that disrupts a complex 
comprising a POSH polypeptide and a POSH-AP and evaluating the effect of the 
test agent on apoptosis of a cell wherein an agent that decreases apoptosis of the cell 
is an anti-apoptotic agent. In yet other embodiments, the application provides 
methods of identifying an anti-cancer agent, comprising identifying a test agent that 
1 5 disrupts a complex comprising a POSH polypeptide and a POSH-AP and evaluating 
the effect of the test agent on proliferation or survival of a cancer cell, wherein an 
agent that decreases proliferation or survival of a cancer cell is an anti-cancer agent. 
Examples of the POSH-AP include PKA, SNX1, PTPN12, PPP1CA, ARF1, ARF5, 
CENTB1, EPS8L2, EEF3S3, CBL-B, RALA, SIAH1, TTC3, ATP6V0C, and 
20 VCY2IP1 . In certain embodiments, the cancer is a POSH-associated cancer. 

In certain aspects, the application provides methods of identifying an agent 
that inhibits trafficking of a protein through the secretory pathway, comprising 
identifying a test agent that disrupts a complex comprising a POSH polypeptide and 
a POSH-AP and evaluating the effect of the test agent on the trafficking of a protein 
25 through the secretory pathway wherein an agent that disrupts localization of said 
POSH-AP is an agent that inhibits trafficking of a protein through the secretory 
pathway. In certain embodiments, the protein is a myristoylated protein. In yet 
other embodiments, the protein is a viral protein. In alternative embodiments, the 
protein is associated with a neurological disorder such as for example the amyloid 
30 beta precursor protein. 

In yet other embodiments, the application provides methods of identifying an 
agent that inhibits the progression of a neurological disorder, comprising identifying 
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a test agent that disrupts a complex comprising a POSH polypeptide and a POSH- 
AP evaluating the effect of the test agent on the trafficking of a protein through the 
secretory pathway wherein an agent that disrupts localization of a POSH-AP is an 
agent that inhibits progression of a neurological disorder. In certain aspects the 

5 POSH-AP is HERPUD1 . 

In yet other embodiments, this application provides methods of treating a 
viral infection in a subject in need thereof, comprising administering an agent that 
inhibits a POSH-AP in an amount sufficient to inhibit the viral infection. The agent 
is one that: inhibits a kinase activity of the POSH-AP; inhibits expression of the 

1 0 POSH-AP; inhibits the ubiquitin ligase activity of the POSH-AP; inhibits the 

phosphatase activity of the POSH-AP; inhibits the GTPase activity of the POSH- 
AP; and inhibits the ubiquitination of the POSH-AP. In certain embodiments, the 
POSH-AP comprises a polypeptide selected from the group consisting of: PKA, 
SNX1, SNX3, SMN1, SMN2, PTPN12, GOSR2, CENTB1, ARF1, ARF5, PACS-1, 

15 EPS8L2, HERPUD1, UNC84B, MSTP028, GOCAP, CBL-B, SYNE1, UBE2N 
(UBC13), SIAH1, TTC3, WASF1, HIP55, RALA, and SPG20. In certain aspects, 
the agent may be an siRNA construct, a small molecule, an antibody, or an antisense 
construct. 

In certain embodiments, the agent is an siRNA construct comprising a 
20 nucleic acid sequence that hybridizes to an mRNA encoding the POSH-AP. 

Examples include siRNA constructs that inhibit the expression of HERPUD1 or 
MSTP028. Examples of siRNA constructs that inhibit the expression of HERPUD1 
include: 5'GGAAGUUCUUCGGAACCUdTdT-3' and 

5'- dTdTCCCUUCAAG AAGCCUUGGA-5 ' . Examples of siRNA constructs that 
25 inhibit the expression of MSTP028 include: 5'-AAGTGCTCACCGACAGTGAAG- 

3' and 5 ' - AAGAT ACTT ATG AGCCTTTCT-3 ' . 

In other aspects, the agents may be a small molecule inhibitor is selected from 

among the following categories: adenosine cyclic monophosphorothioate, 

isoquinoUnesulfonamide, piperazine, piceatannol, and ellagic acid. In alternative 
30 embodiments, the agents may be a small molecule inhibitor that inhibits the ligase 

activity of a POSH polypeptide or inhinbits the ubiquitination of a POSH-AP. 

Examples of such small molecules include, for example: 
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In certain embodiments, the application provides packaged pharmaceuticals 
for treating viral infections, comprising: a pharmaceutical composition comprising 
an inhibitor of a POSH-AP and a pharmaceutical^ acceptable carrier and 

instructions for use. 

In certain embodiments, the application provides methods of treating or 
preventing a POSH associated cancer in a subject comprising administering an agent 
that inhibits a POSH-AP to a subject in need thereof, wherein said agent treats or 



9372369 1 



6- 



Attorney Docket No.: PROL-PWO-024 

P C TV U S O HI-/ 01 6 3 O 8 



prevents cancer. The POSH-AP comprises a polypeptide selected from the group 
consisting of: PKA, SNX1, PTPN12, PPP1CA, CENTB1, ARF1, ARF5, EPS8L2, 
EEF3S3, CBL-B, RALA, SIAH1, TTC3, ATP6V0C, and VCY2IP1. 

In yet other aspects, the application provides methods of treating a 

5 neurological disorder comprising administering an agent to a subject in need thereof, 
wherein said agent either inhibits the Ubiquitin ligase activity of POSH or inhibits 
the ubquitination of a POSH-AP. Examples of the POSH-AP include: PTPN12, 
DDEF1, EPS8L2, HERPUD1, GOCAP, CBL-B, SIAH1, SMN1, SMN2, TTC3, 
SPG20, SNX1 , and ARF1 . 

10 Examples of the neurological disorders include Alzheimer's disease, Parkinson's 

disease, Huntington's disease, schizophrenia, Niemann-Pick's disease, and prion- 
associated diseases. In certain aspects, the agent is selected from the group 
consisting of: an siRNA construct, a small molecule, an antibody, and an aritisense 
construct. Examples of the small molecules include: 
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In certain aspects, the disclosure provides methods of treating viral hepatitis 
in a subject in need thereof. Such a method may comprise administering an 
effective amount of an agent that inhibits POSH or disrupts an interaction between 
POSH and a dynamin, preferably dynaminn. In certain embodiments, the subject 

JO has a viral hepatitis caused by HBV or HCV. 

In certain aspects, the disclosure provides methods of inhibiting a 
hepatotrophic virus or a method for treating a disease associated with a hepatrophic 
virus, comprising administering an effective amount of an agent, wherein said agent 
inhibits POSH or an interaction between POSH and dynamin. In certain 

1 5 embodiments, the hepatrophic virus is selected from the group consisting of HAV, 
HBV, HCV, HDV, and HEV. The hepatotrophic virus associated disease may be, 
for example, viral hepatitis or hepatocellular carcinoma. An agent for any of the 
above methods may include, for example, a nucleic acid agent that decreases the 
level of POSH in cells of the subject (e.g., an antisense oligonucleotide, an RNAi 

20 construct, a DNA enzyme, a ribozyme) or small molecule inhibitors of POSH, as 
well as antibodies or other binding agents that bind to a surface of POSH or 
dynamin that participates in a POSH-dynamin interaction. An agent may be any of 
the following: a small molecule, an antibody, a fragment of an antibody, a 
peptidomimetic, and a polypeptide. Examples of small molecules include: 



9372369_1 



Attorney Docket No.: PROL-PWO-024 

P C TV" U S 01 *+/ O 6 3 01 8 



10 



STRUCTURE 


MW 


CAS number 


F 


384.2 


14567-55-4 


Br 0 

Br 


389.5 


414908-38-0 



15 



In certain embodiments, the application provides methods for inhibiting an 
HBV infection in a subject in need thereof, comprising administering an effective 
amount of a POSH inhibitor, wherein the HBV infection is inhibited in the subject. 
In additional embodiments, the disclosure provides methods for treating an HBV 
infection in a patient, comprising administering an effective amount of an agent that 
inhibits POSH or decreases the level of POSH protein or nucleic acid in an infected 
cell. An agent may be, for example, an RNAi construct that inhibits the expression 
of POSH. Optionally the RNAi construct is 20-25 nucleotides in length and 
optionally it is selected from any one of SEQ ID NOS: 15, 16, 18, 19, 21, 22, 24, 
and 25. The RNAi may be formulated as a liposome. An agent may be a small 
molecule inhibitor of POSH ubiquitin ligase activity, as disclosed herein. Examples 
of small molecule inhibitors of POSH include: 
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MW 



384.2 




389.5 



CAS number 
14567-55-4 



414908-38-0 



in certain aspects, the disclosure provides a method for treating an HBV 



infection in a patient, comprising administering an effective amount of an antisense 
oligonucleotide sufficient to bind a nucleic acid molecule, which nucleic acid 
molecule encodes a POSH polypeptide. 

In certain embodiments, the application provides methods for inhibiting an 
HBV infection by administering an effective amount of a compound of the formula: 




10 



In additional embodiments, the application provides methods for treating an 
HBV infection by administering an effective amount of a compound of the formula: 
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Br 

In certain aspects, the disclosure provides methods for inhibiting the' 
maturation of a lentivirus by modulating the activity of a Vpu polypeptide. In 
preferred embodiments, maturation of the lentivirus is inhibited by inhibiting the 
5 transport and/or assembly of viral particles in the TGN and from the TGN to the 
plasma membrane. A preferred lentivirus for application of such a method is the 
human immunodeficiency virus. 

In certain aspects, the disclosure provides methods of inhibiting viral 
infection comprising administering an agent to a subject in need thereof, wherein 
10 said agent inhibits the interaction between a POSH polypeptide and Vpu. 

In certain aspects, the disclosure provides methods for identifying a target 
polypeptide for antiviral therapy, the method comprising: a) selecting a test 
polypeptide known to localize or predicted to localize to the trans Golgi network; b) 
inhibiting an activity of the test polypeptide in a cell infected with a viral construct 
15 under conditions where, but for the inhibition of the activity of the test polypeptide, 
viral particles are released from the cell; and c) deterauning whether viral particles 
are released from the cell, wherein, if inhibiting the activity of the test polypeptide in 
the cell inhibits the release of viral particles from the cell, the test polypeptide is a 
target polypeptide for antiviral therapy. In a preferred embodiment, the test 
20 polypeptide is Vpu. Vpu activity may be inhibited, for example, by siRNA, 
antisense or other nucleic acid based method. 

In certain aspects, the disclosure provides isolated, purified or recombinant 
complexes comprising a POSH polypeptide and a Vpu polypeptide. The POSH 
polypeptide may comprise, for example, a POSH SH3 domain, or a polypeptide at 
25 least 80% identical to such an SH3 domain. An antiviral agent may be selected 
based on its ability to disrupt a POSH- Vpu complex. 

The practice of the present application will employ, unless otherwise 
indicated, conventional techniques of cell biology, cell culture, molecular biology, 
9372369J 
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transgenic biology, microbiology, recombinant DNA, and immunology, which are 
within the skill of the art. Such techniques are explained fully in the literature. See, 
for example, Molecular Cloning A Laboratory Manual, 2nd Ed., ed. by Sambrook, 
Fritsch and Maniatis (Cold Spring Harbor Laboratory Press: 1989); DNA Cloning, 

5 Volumes I and II (D. N. Glover ed., 1985); Oligonucleotide Synthesis (M. J. Gait 
ed., 1984); Mullis et al. U.S. Patent No: 4,683,195; Nucleic Acid Hybridization (B. 
D. Hames & S. J. Higgins eds. 1984); Transcription And Translation (B. D. Hames 
& S. J. Higgins eds. 1984); Culture Of Animal Cells (R. I. Freshney, Alan R. Liss, 
Inc., 1987); Immobilized Cells And Enzymes (IRL Press, 1986); B. Perbal, A 

0 Practical Guide To Molecular Cloning ( 1 984); the treatise, Methods In Enzymolpgy 
(Academic Press, Inc., N.Y.); Gene Transfer Vectors For Mammalian Cells (J. H. 
Miller and M. P. Calos eds., 1987, Cold Spring Harbor Laboratory); Methods In 
Enzymology, Vols. 154 and 155 (Wu et al. eds.), Immunochemical Methods In Cell 
And Molecular Biology (Mayer and Walker, eds., Academic Press, London, 1987); 
15 Handbook Of Experimental Immunology, Volumes I-IV (D. M. Weir and C. C. 
Blackwell, eds., 1986); Manipulating the Mouse Embryo, (Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1986). 

Other features and advantages of the application will be apparent from the 
following detailed description, and from the claims. 

20 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows human POSH coding sequence (SEQ ID NO:l). 
Figure 2 shows human POSH amino acid sequence (SEQ ID NO:2). 
Figure 3 shows human POSH cDNA sequence (SEQ ID NO:3). 
25 Figure 4 shows 5' cDNA fragment of human POSH (public gi:10432611; 

SEQ ID NO:4). 

Figure 5 shows N terminus protein fragment of hPOSH (public gi:10432612; 
SEQ ID NO:5). 

Figure 6 shows 3' mRNA fragment of hPOSH (public gi:7959248; SEQ ID 

30 NO:6). 

Figure 7 shows C terminus protein fragment of hPOSH (public gi:7959249; 
SEQ ID NO:7). 
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Figure 8 shows human POSH full mRNA, annotated sequence. 
Figure 9 shows domain analysis of human POSH. 

Figure 10 is a diagram of human POSH nucleic acids. The diagram shows 
the full-length POSH gene and the position of regions amplified by RT-PCR or 
5 targeted by siRNA used in figure 1 1 . 

Figure 11 shows effect of knockdown of POSH mRNA by siRNA duplexes. 
HeLa S S-6 c ells w ere t ransfected w ith s iRNA a gainst Lamin A/C ( lanes 1 , 2 ) o r 
POSH (lanes 3-10). POSH siRNA was directed against the coding region (153 - 
lanes 3, 4; 155 - lanes 5, 6) or the 3'UTR (157 - lanes 7, 8; 159 - lanes 9, 10). Cells 
10 were harvested 24 hours post-transfection, RNA extracted, and POSH mRNA levels 
compared by RT-PCR of a discrete sequence in the coding region of the POSH gene 
(see figure 10). GAPDH is used an RT-PCR control in each reaction. 

Figure 12 shows that POSH affects the release of VLP from cells. A) 
Phosphohimages of SDS-PAGE gels of immunoprecipitations of 35 S pulse-chase 
15 labeled Gag proteins are presented for cell and viral lysates from transfected HeLa 
cells that were either untreated or treated with POSH RNAi (50 nM for 48 hours). 
The time during the chase period (1, 2, 3, 4, and 5 hours after the pulse) are 
presented from left to right for each image. 

Figure 13 shows release of VLP from cells at steady state. Hela cells were 
20 transfected with an HTV-encoding plasmid and siRNA Lanes 1, 3 and 4 were 
transfected with wild-type HlV-encoding plasmid. Lane 2 was transfected with an 
HIV-encoding plasmid which contains a point mutation in p6 (PTAP to ATAP). 
Control siRNA (lamin A/C) was transfected to cells in lanes 1 and 2. siRNA to 
TsglOl was transfected in lane 4 and siRNA to POSH in lane 3. 
25 Figure 14 shows mouse POSH mRNA sequence (public gi: 10946921; SEQ 

ID NO: 8). 

Figure 15 shows mouse POSH Protein sequence (Public gi:10946922; SEQ 
ID NO: 9). 

Figure 1 6 s hows D rosophila m elanogaster P OSH m RNA s equence (public 
30 gi: 17737480; SEQ ID NO: 10). 

Figure 17 shows Drosophila melanogaster POSH protein sequence (public 

gi: 1 773748 1 ; SEQ ID NO: 1 1 ). 
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Figure 1 8 shows POSH domain analysis. 

Figure 19 shows that human POSH has ubiquitin ligase activity. 
^ - -Figure 20 shows that human POSH co-immunoprecipitates with RAC1 . 

Figure 21 shows that POSH knockdown results in decreased secretion of 

5 phospholipase D ("PLD"). 

Figure 22 shows effect of hPOSH on Gag-EGFP intracellular distribution. 
Figure 23 shows intracellular distribution of HIV-1 Nef in hPOSH-depleted 

cells. 

Figure 24 shows intracellular distribution of Src in hPOSH-depleted cells. 
10 Figure 25 shows intracellular distribution of Rapsyn in hPOSH-depleted 

cells. 

Figure 26 shows that POSH reduction by siRNA abrogates West Nile virus 
infectivity. 

Figure 27 shows that POSH knockdown decreases the release of extracellular 

15 MMuLV particles. 

Figure 28 shows that knock-down of human POSH entraps HIV virus 
particles in intracellular vesicles. HIV virus release was analyzed by electron 
microscopy following siRNA and full-length HIV plasmid transfection. Mature 
viruses were secreted by cells transfected with HIV plasmid and non-relevant siRNA 

20 (control, bottom panel). Knockdown of TsglOl protein resulted in a budding defect, 
the viruses that were released had an immature phenotype (top panel). Knockdown 
of hPOSH levels resulted in accumulation of viruses inside the cell in intracellular 

vesicles (middle panel). 

Figure 29A shows siRNA-mediated reduction of MSTP028 expression 
25 inhibits HIV virus-like particle production (Experiment 1). 

Figure 29B shows siRN A-mediated reduction of MSTP028 expression 
inhibits HIV virus-like particle production (Experiment 2). 

Figure 30 shows putative PKA phosphorylation sites in hPOSH. Amino acid 
sequence of hPOSH (70 residues per line)-. Motifs of the low stringency RxxS/T 
30 type are underlined. The high stringency motif R/KR/KxS/T is bordered. Putative 
S/T phosphorylation sites are highlighted in green. Color-coding of domains: Red - 
RING, Blue - SH3, Green - putative Rac-1 Binding Domain. 
9372369_1 
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Figure 31 shows that phosphorylation of hPOSH regulates binding of GTP- 
loaded Rac-1. Bacterially expressed hPOSH (1 ug) (POSH) or GST (1 ug) <NS) 
were phosphorylated. Subsequently, GTPyS loaded or unloaded recombinant Rac-1 
(0.2 ug) was added to hPOSH or GST. Bound racl was isolated as described in 
5 materials and methods and samples separated by SDS-PAGE on a 12% gel and 
immunobloted with anti-Rac-1. Input is 0.25 ug of Rac-1. 

Figure 32 shows domain analysis of various POSH-APs. 

Figure 33 shows siRNA-mediated reduction in HERPUD1 expression 

reduces HIV maturation. 
10 Figure 34A shows that endogenous Herp levels are reduced in H153 cells. 

HI 53 (POSH-RNAi) and HI 87 (control RNAi) cells were transfected with a plasmid 
encoding Flag-ubiquitin. Total cell lysates (A) or Flag-imrhunoprecipitated material 
(B) were separated on 10% SDS-PAGE and immunoblotted with anti-Herp 
antibodies. 

15 Figure 34B shows that exogenous Herp levels and its ubiquitination are 

reduced in POSH-depleted cells. H153 and HI 87 cells were co-transfected with 
Herp or control plasmids and a plasmid encoding Flag-ubiquitin (indicated above the 
figure). Total (A) and flag-immunoprecipitated material (B) were separated on 10% 
SDS-PAGE and immunoblotted with anti-Herp antibodies. 

20 Figure 35 shows that the compounds CAS number 14567-55-4 and CAS 

number 414908-38-0 (lanes 7 and 8) inhibit HBV production. 

Figure 36 provides the nucleic acid and amino acid sequences of POSH-APs. 

DETAILED DESCRIPTION OF THE APPLICATION 
25 1. Definitions 

The term "binding" refers to a direct association between two molecules, due 
to, for example, covalent, electrostatic, hydrophobic, ionic and/or hydrogen-bond 
interactions under physiological conditions. 

A "chimeric protein" or "fusion protein" is a fusion of a first amino acid 
30 sequence encoding a polypeptide with a second amino acid sequence defining a 
domain foreign to and not substantially homologous with any domain o f the first 
amino acid sequence. A chimeric protein may present a foreign domain which is 
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found (albeit in a d ifferent p rotein) in an organism which also expresses the first 
protein, or it may be an "interspecies", "intergenic", etc. fusion of protein structures 
expressed by different kinds of organisms. 

The terms "compound", "test compound" and "molecule" are used herein 
5 interchangeably and are meant to include, but are not limited to, peptides, nucleic 
acids, carbohydrates, small organic molecules, natural product extract libraries, and 
any other molecules (including, but not limited to, chemicals, metals and 
organometallic compounds). 

The phrase "conservative amino acid substitution" refers to grouping of 
10 amino acids on the basis of certain common properties. A functional way to define 
common properties between individual amino acids is to analyze the normalized 
frequencies of amino acid changes between corresponding proteins of homologous 
organisms (Schulz, G. E. and R. H. Schirmer., Principles of Protein Structure, 
Springer-Verlag). According to such analyses, groups of amino acids may be 
15 defined where amino acids within a group exchange preferentially with each other, 
and therefore resemble each other most in their impact on the overall protein 
structure (Schulz, G. E. and R. H. Schirmer, Principles of Protein Structure, 
Springer-Verlag). Examples of amino acid groups defined in this manner include: 
(i) a charged group, consisting of Glu and Asp, Lys, Arg and His, 
20 (ii) a positively-charged group, consisting of Lys, Arg and His, 

(iii) a negatively-charged group, consisting of Glu and Asp, 

(iv) an aromatic group, consisting of Phe, Tyr and Trp, 

(v) a nitrogen ring group, consisting of His and Trp, 

(vi) a large aliphatic nonpolar group, consisting of Val, Leu and He, 
25 (vii) a slightly-polar group, consisting of Met and Cys, 

(viii) a small-residue group, consisting of Ser, Thr, Asp, Asn, Gly, Ala, Glu, Gin 
and Pro, 

(ix) an aliphatic group consisting of Val, Leu, He, Met and Cys, and 

(x) a small hydroxyl group consisting of Ser and Thr. 

30 In addition to the groups presented above, each amino acid residue may form 

its own group, and the group formed by an individual amino acid may be referred to 



9372369J 



-16- 



Attorney Docket No.: PROL-PWO-024 

P C T / U S O H-y O B 3 O S 



sijnply by the one and/or three letter abbreviation for that amino acid commonly 
used in the art. 

, , ' _A "conserved residue" is an amino acid that is relatively invariant across a 
range of similar proteins. Often conserved residues will vary only by being replaced 
5 with a similar amino acid, as described above for "conservative amino acid 
substitution". 

The term "domain" as used herein refers to a region of a protein that 
comprises a particular structure and/or performs a particular function. 

The term "envelope virus" as used herein refers to any virus that uses cellular 
1 0 membrane and/or any organelle membrane in the viral release process. 

"Homology" or "identity" or "similarity" refers to sequence similarity 
between two peptides or between two nucleic acid molecules. Homology and 
identity can each be determined by comparing a position in each sequence which 
may b e a ligned for p urposes o f c omparison. W hen an equivalent p osition i n t he 
1 5 compared sequences is occupied by the same base or amino acid, then the molecules 
are i dentinal at that p osition; when the e quivalent s ite occupied b y the s ame o r a 
similar amino acid residue (e.g., similar in steric and/or electronic nature), then the 
molecules can be referred to as homologous (similar) at that position. Expression as 
a percentage of homology/similarity or identity refers to a function of the number of 
20 identical or similar amino acids at positions shared by the compared sequences. A 
sequence which is "unrelated" or "non-homologous" shares less than 40% identity, 
though preferably less than 25% identity with a sequence of the present application. 
In comparing two sequences, the absence of residues (amino acids or nucleic acids) 
or presence of extra residues also decreases the identity and homology/similarity. 
25 The term "homology" describes a mathematically based comparison of 

sequence similarities which is used to identify genes or proteins with similar 
functions or motifs. The nucleic acid and protein sequences of the present 
application may be used as a "query sequence" to perform a search against public 
databases to, for example, identify other family members, related sequences or 
30 homologs. Such searches can be performed using the NBLAST and XBLAST 
programs (version 2.0) of Altschul, et al. (1990) J Mol. Biol. 215:403-10. BLAST 
nucleotide searches can be performed with the NBLAST program, score=100, 
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wordlength=12 to obtain nucleotide sequences homologous to nucleic acid 
molecules of the application. BLAST protein searches can be performed with the 
XBLAST program, score=50, wordlength=3 to obtain amino acid sequences 
homologous to protein molecules of the application. To obtain gapped alignments 
5 for comparison purposes, Gapped BLAST can be utilized as described in Altschul et 
al., (1997) Nucleic Acids Res. 25(17):3389-3402. When utilizing BLAST and 
Gapped BLAST programs, the default parameters of the respective programs (e.g., 
XBLAST and BLAST) can be used. See http://www.ncbi.nlm.nih.gov. 

As used herein, "identity" means the percentage of identical nucleotide or 
10 amino acid residues at corresponding positions in two or more sequences when the 
sequences are aligned to maximize sequence matching, i.e., taking into account gaps 
and insertions. Identity can be readily calculated by known methods, including but 
not limited to those described in (Computational Molecular Biology, Lesk, A. M., 
ed., Oxford University Press, New York, 1988; Biocomputing: Informatics and 
15 Genome Projects, Smith, D. W., ed., Academic Press, New York, 1993; Computer 
Analysis of Sequence Data, Part I, Griffin, A. M., and Griffin, H. G, eds., Humana 
Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, G, 
Academic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, 
J., eds., M Stockton Press, New York, 1991; and Carillo, H., and Lipman, D., SIAM 
20 J. Applied Math., 48: 1073 (1988). Methods to determine identity are designed to 
give the largest match between the sequences tested. Moreover, methods to 
determine identity are codified in publicly available computer programs. Computer 
program methods to determine identity between two sequences include, but are not 
limited to, the GCG program package (Devereux, J., et al., Nucleic Acids Research 
25 12(1): 387 (1984)), BLASTP, BLASTN, and FASTA (Altschul, S. F. et al., J. 
Molec. Biol. 215: 403-410 (1990) and Altschul et al. Nuc. Acids Res. 25: 3389-3402 
(1997)). The BLAST X program is publicly available from NCBI and other sources 
(BLAST Manual, Altschul, S., et al., NCBI NLM NIH Bethesda, Md. 20894; 
Altschul, S., et al., J. Mol. Biol. 215: 403-410 (1990). The well known Smith 
30 Waterman algorithm may also be used to determine identity. 

The term "isolated", as used herein with reference to the subject proteins and 
protein complexes, refers to a preparation of protein or protein complex that is 
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essentially free from contaminating proteins that normally would be present with the 
protein or complex, e.g., in the cellular milieu in which the protein or complex is 
found endogenously. Thus, an isolated protein complex is isolated from cellular 
components t hat n ormally would " contaminate" o r i nterfere w ith t he s tudy o f t he 
5 complex in isolation, for instance while screening for modulators thereof. It is to be 
understood, however, that such an "isolated" complex may incorporate other 
proteins the modulation of which, by the subject protein or protein complex, is being 
investigated. 

The term "isolated" as also used herein with respect to nucleic acids, such as 
10) DNA or RNA, refers to molecules in a form which does not occur in nature. 
Moreover, an "isolated nucleic acid" is meant to include nucleic acid fragments 
which are not naturally occurring as fragments and would not be found in the natural 
state. 

Lentiviruses include primate lentiviruses, e.g., human immunodeficiency 

15 virus types 1 and 2 (HIV-l/HIV-2); simian immunodeficiency virus (SIV) from 
Chimpanzee (SIVcpz), Sooty mangabey (SIVsmm), African Green Monkey 
(SrVagm),Syke's monkey (SlVsyk), M andrill ( SIVmnd) and Macaque (SlVmac). 
Lentiviruses also include feline lentiviruses, e.g., Feline immunodeficiency virus 
(FIV); Bovine lentiviruses, e.g., Bovine immunodeficiency virus (BIV); Ovine 

20 lentiviruses, e.g., Maedi/Visna virus (MVV) and Caprine arthritis encephalitis virus 
(CAEV); and Equine lentiviruses, e.g., Equine infectious anemia virus (EIAV). All 
lentiviruses express at least two additional regulatory proteins (Tat, Rev) in addition 
to Gag, Pol, and Env proteins. Primate lentiviruses produce other accessory proteins 
including Nef, Vpr, Vpu, Vpx, and Vif. Generally, lentiviruses are the causative 

25 agents of a variety of disease, including, in addition to immunodeficiency, 
neurological degeneration, and arthritis. Nucleotide sequences of the various 
lentiviruses can be found in Genbank under the following Accession Nos. (from J. 
M. Coffin, S. H. Hughes, and H. E. Varmus, "Retroviruses" Cold Spring Harbor 
Laboratory Press, 1 99,7 p 804): 1 ) HIV-1: K03455, M 19921, K02013, M3843 1 , 

30 M38429, K02007 and Ml 7449; 2) HIV-2: M30502, J04542, M30895, J04498, 
M15390, M31113 and L07625; 3) SIV:M29975, M30931, M58410, M66437, 
L06042, M33262, M19499, M32741, M31345 and L03295; 4) FIV: M25381, 
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M36968 and Ul 1820; 5)BIV. M32690; 6) E1AV: M16575, M87581 and U01866; 
6) Visna: M10608, M51543, L06906, M60609 and M60610; 7) CAEV: M33677; 
arid 8X Ovine lentivirus M3 1646 and M34193. Lentiviral DNA can also be obtained 
from the American Type Culture Collection (ATCC). For example, feline 
5 immunodeficiency virus is available under ATCC Designation No. VR-2333 and 
VR-3112. Equine infectious anemia virus A is available under ATCC Designation 
No. VR-778. Caprine arthritis-encephalitis virus is available under ATCC 
Designation No. VR-905. Visna virus is available under ATCC Designation No. 
VR-779. 

10 As used herein, the term "nucleic acid" refers to polynucleotides such as 

deoxyribonucleic acid (DNA), and, where appropriate, ribonucleic acid (RNA). The 
term should also be understood to include, as equivalents, analogs of either RNA or 
DNA made from nucleotide analogs, and, as applicable to the embodiment being 
described, single-stranded (such as sense or antisense) and double-stranded 

15 polynucleotides. 

The term "maturation" as used herein refers to the production, post- 
translational processing, assembly and/or release of proteins that form a viral 
particle. Accrodingly, this includes the processing of viral proteins leading to the 
pinching off of nascent virion from the cell membrane. 
20 A "POSH nucleic acid" is a nucleic acid comprising a sequence as 

represented in any of SEQ ID Nos:l, 3, 4, 6, 8, and 10 as well as any of the variants 
described herein. 

A "POSH polypeptide" or "POSH protein" is a polypeptide comprising a 
sequence as represented in any of SEQ ID Nos: 2, 5, 7, 9 andl 1 as well as any of the 
25 variations described herein. 

A "POSH-associated protein" or "POSH-AP" refers to a protein capable of 
interacting with and/or binding to a POSH polypeptide. Generally, the POSH-AP 
may interact directly or indirectly with the POSH polypeptide. Preferred POSH-APs 
include those provided in Table 7. Other preferred POSH-APs include those listed 
30 in Table 8. Examples of these and other POSH-APs are provided throughout. 
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The terms peptides, proteins and polypeptides are used interchangeably 

herein. 

The term "purified protein" refers to a preparation of a protein or proteins 
which are preferably isolated from, or otherwise substantially free of, other proteins 
normally associated with the protein(s) in a cell or cell lysate. The term 
"substantially free of other cellular proteins" (also referred to herein as "substantially 
free of other contaminating proteins") is denned as encompassing individual 
preparations of each of the component proteins comprising less than 20% (by dry 
weight) contaminating protein, and preferably comprises less than 5% contaminating 
protein. Functional forms of each of the component proteins can be prepared as 
purified preparations by using a cloned gene as described in the attached examples. 
By "purified", it is meant, when referring to component protein preparations used to 
generate a reconstituted protein mixture, that the indicated molecule is present in the 
substantial absence of other biological macromolecules, such as other proteins 
1 5 (particularly other proteins which may substantially mask, diminish, confuse or alter 
the characteristics of the component proteins either as purified preparations or in 
their function in the subject reconstituted mixture). The term "purified" as used 
herein preferably means at least 80% by dry weight, more preferably in the range of 
85% by weight, more preferably 95-99% by weight, and most preferably at least 
20 99.8% by weight, of biological macromolecules of the same type present (but water, 
buffers, and other small molecules, especially molecules having a molecular weight 
of less than 5000, can be present). The term "pure" as used herein preferably has the 
same numerical limits as "purified" immediately above. 

A "recombinant nucleic acid" is any nucleic acid that has been placed 
25 adjacent to another nucleic acid by recombinant DNA techniques. A "recombined 
nucleic acid" also includes any nucleic acid that has been placed next to a second 
nucleic acid by a laboratory genetic technique such as, for example, tranformation 
and integration, transposon hopping or viral insertion. In general, a recombined 
nucleic acid is not naturally located adjacent to the second nucleic acid. 
30 The term "recombinant protein" refers to a protein of the present application 

which is produced by recombinant DNA techniques, wherein generally DNA 
encoding the expressed protein is inserted into a suitable expression vector which is 
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in turn used to transform a host cell to produce the heterologous protein. Moreover, 
the phrase "derived from", with respect to a recombinant gene encoding the 
recombinant protein is meant to include within the meaning of "recombinant 
protein" those proteins having an amino acid sequence of a native protein, or an 
5 amino acid sequence similar thereto which is generated by mutations including 
substitutions and deletions of a naturally occurring protein. 

A "RING domain" or "Ring Finger" is a zinc-binding domain with a defined 
octet of cysteine and histidine residues. Certain RING domains comprise the 
consensus sequences as set forth below (amino acid nomenclature is as set forth in 
10 Table 1): Cys Xaa Xaa Cys Xaa, 0 - 20 Cys Xaa His Xaa 2 - 5 Cys Xaa Xaa Cys Xaa l3 - 5 o 
Cys Xaa Xaa Cys or Cys Xaa Xaa Cys Xaa 10 - 20 Cys Xaa His Xaa 2 . 5 His Xaa Xaa 
Cys Xaa, 3 . 5 o Cys Xaa Xaa Cys. Certain RING domains are represented as amino 
acid sequences that are at least 80% identical to amino acids 12-52 of SEQ ID NO: 2 
and is set forth in SEQ ID No: 26. Preferred RING domains are 85%, 90%, 95%, 
15 98% and, most preferably, 100% identical to the amino acid sequence of SEQ ID 
NO: 26. Preferred RING domains of the application bind to various protein partners 
to form a complex that has ubiquitin ligase activity. RING domains preferably 
interact with at least one of the following protein types: F box proteins, E2 ubiquitin 
conjugating enzymes and cullins. 
20 The term "RNA interference" or "RNAi" refers to any method by which 

expression of a gene or gene product is decreased by introducing into a target cell 
one or more double-stranded RNAs which are homologous to the gene of interest 
(particularly to the messenger RNA of the gene of interest). RNAi may also be 
achieved by introduction of a DNA:RNA hybrid wherein the antisense strand 
25 (relative to the target) is RNA. Either strand may include one or more modifications 
to the base or sugar-phosphate backbone. Any nucleic acid preparation designed to 
achieve an RNA interference effect is referred to herein as an siRNA construct. 
Phosphorothioate is a particularly common modification to the backbone of an 
siRNA construct. 

30 "Small molecule" as used herein, is meant to refer to a composition, which 

has a molecular weight of less than about 5 kD and most preferably less than about 
2.5 kD. Small molecules can be nucleic acids, peptides, polypeptides, 
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peptidornimetics, carbohydrates, lipids or other organic (carbon containing) or 
inorganic molecules. Many pharmaceutical companies have extensive libraries of 
chemical and/or biological mixtures comprising arrays of small molecules, often 
fungal, bacterial, or algal extracts, which can be screened with any of the assays of 

5 the application. 

An " SH3" o r "Src H omology 3 " d omain i s a p rotein d omain o f generally 
about 60 amino acid residues first identified as a conserved sequence in the non- 
catalytic part of several cytoplasmic protein tyrosine kinases (e.g., Src, Abl, Lck). 
SH3 domains mediate assembly of specific protein complexes via binding to 
10 proline-rich peptides. Exemplary SH3 domains are represented by amino acids 137- 
192, 199-258, 448-505 and 832-888 of SEQ ID NO:2 and are set forth in SEQ ID 
Nos: 27-30. In certain embodiments, an SH3 domain interacts with a consensus 
sequence of RXaaXaaPXaaX6P (where X6, as defined in table 1 below, is a 
hydrophobic amino acid). In certain embodiments, an SH3 domain interacts with 
15 one or more of the following sequences: P(T/S)AP, PFRDY, RPEPTAP, 
RQGPKEP, RQGPKEPFR, RPEPTAPEE and RPLPVAP. 

As used herein, the term "specifically hybridizes" refers to the ability of a 
nucleic acid probe/primer of the application to hybridize to at least 12, 15, 20, 25, 
30, 35, 40, 45, 50 or 100 consecutive nucleotides of a POSH sequence, or a 
20 sequence complementary thereto, or naturally occurring mutants thereof, such that it 
has less than 15%, preferably less than 10%, and more preferably less than 5% 
background hybridization to a cellular nucleic acid (e.g., mRNA or genomic DNA) 
other than the POSH gene. A variety of hybridization conditions may be used to 
detect specific hybridization, and the stringency is determined primarily by the wash 
25 stage of the hybridization assay. Generally high temperatures and low salt 
concentrations give high stringency, while low temperatures and high salt 
concentrations give low stringency. Low stringency hybridization is achieved by 
washing in, for example, about 2.0 x SSC at 50 °C, and high stringency is acheived 
with about 0.2 x SSC at 50 °C. Further descriptions of stringency are provided 
30 below. 

As applied to polypeptides, "substantial sequence identity" means that two 
peptide sequences, when optimally aligned, such as by the programs GAP or 
9372369_1 
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BESTFIT using default gap which share at least 90 percent sequence identity, 
preferably at least 95 percent sequence identity, more preferably at least 99 percent 
sequence identity or more. Preferably/residue positions which are not identical 
differ by conservative amino acid substitutions. For example, the substitution of 
amino acids having similar chemical properties such as charge or polarity are not 
likely to effect the properties of a protein. Examples include glutamine for 
asparagine or glutamic acid for aspartic acid. 

As is well known, genes for a particular polypeptide may exist in single or 
multiple copies within the genome of an individual. Such duplicate genes may be 
identical or may have certain modifications, including nucleotide substitutions, 
additions or deletions, which all still code for polypeptides having substantially the 
same activity. 

A "virion" is a complete viral particle; nucleic acid and capsid (and a lipid 
envelope in some viruses. A "viral particle" may be incomplete, as when produced 
by a cell transfected with a defective virus (e.g., an HIV virus-like particle system). 



Table 1: Abbreviations for classes of amino acids* 



Symbol 


Category 


Amino Acids 
Represented 


XI 


Alcohol 


Ser,Thr 


X2 


Aliphatic 


He, Leu, Val 


Xaa 


Any 


Ala, Cys, Asp, Glu, Phe, 
Gly, His, lie, Lys, Leu, 
Met, Asn, Pro, Gin, Arg, 
Ser, Thr, Val, Trp, Tyr 


X4 


Aromatic 


Phe, His, Trp, Tyr 


X5 


Charged 


Asp, Glu, His, Lys, Arg 
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X6 


Hydrophobic 


Ala, Cys, Phe, Gly, His, 
lie, Lys, Leu, Met, Thr, 
Val, Trp, Tyr 


X7 


Negative 


Asp, Glu 


X8 


Polar 


Cys, Asp, Glu, His, Lys, 
Asn, Gin, Arg, Ser, Thr 


X9 


Positive 


His, Lys, Arg 


XIO 


Small 


Ala, Cys, Asp, Gly, Asn, 
Pro, Ser, Thr, Val 


XI 1 


Tiny 


Ala, Gly, Ser 


X12 


Tumlike 


Ala, Cys, Asp, Glu, Gly, 
His, Lys, Asn, Gin, Arg, 
Ser, Thr 


X13 


Asparagine-Aspartate 


Asn, Asp 



* Abbreviations as adopted from http://smart.embl- 
heidelberg.de/SMART_DATA/alignments/consensus/grouping.html. 



2. Overview 

In certain aspects, the application relates to the discovery of novel 
5 associations between POSH proteins and other proteins (termed POSH-APs), and 
related methods and compositions. In certain aspects, the application relates to 
novel associations among certain disease states, POSH nucleic acids and proteins, 
and POSH-AP nucleic acids and proteins. 

In certain aspects, by identifying proteins associated with POSH, and 
10 particularly human POSH, the present application provides a conceptual link 
between the POSH-APs and cellular processes and disorders associated with POSH- 
APs, and POSH itself. Accordingly, in certain embodiments of the disclosure, 
agents that modulate a POSH-AP may now be used to modulate POSH functions 
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and disorders associated with POSH function, such as viral disorders, POSH- 
associated cancers, and POSH-associated neural disorders. Additionally, test agents 
may be_screened for an effect on a POSH-AP and then further tested for an effect on 
a POSH function or a disorder associated with POSH function. Likewise, in certain 
5 embodiments of the disclosure, agents that modulate POSH may now be used to 
modulate POSH-AP functions and disorders associated with POSH-AP -function, 
including a variety of cancers. Additionally, test agents may be screened for an 
effect on POSH and then further tested for effect on a POSH-AP function or a 
disorder associated with POSH-AP function. In further aspects, the application 
10 provides nucleic acid agents (e.g., RNAi probes, antisense nucleic acids), antibody- 
related agents, small molecules and other agents that affect POSH function, and the 
use of same in modulating POSH and/or POSH-AP activity: 

POSH intersects with and regulates a wide range of key cellular functions 
that may be manipulated by affecting the level of and/or activity of POSH 
15 polypeptides or POSH-AP polypeptides. Many features of POSH, and particularly 
human POSH, are described in PCT patent publications WO03/095971A2 
(application no. WO2002US0036366) and WO03/078601A2 (application no. 
WO2003US0008194) the teachings of which are incorporated by reference herein. 

As described in the above-referenced publications, native human POSH is a 
20 large polypeptide containing a RING domain and four SH3 domains. POSH is a 
ubiquitin ligase (also termed an "E3" enzyme); the RING domain mediates 
ubiquitination of, for example, the POSH polypeptide itself. POSH interacts with a 
large number of proteins and participates in a host of different biological processes. 
As demonstrated in this disclosure, POSH associates with a number of different 
25 proteins in the cell. POSH co-localizes with proteins that are known to be located in 
the trans-Golgi network, implying that POSH participates in the trafficking of 
proteins in the secretory system. The term "secretory system" should be understood 
as referring to the membrane compartments and associated proteins and other 
molecules that are involved in the movement of proteins from the site of translation 
30 to a location within a vacuole, a compartment in the secretory pathway itself, a 
lysosome or endosome or to a location at the plasma membrane or outside the cell. 
Commonly cited examples of compartments in the secretory system include the 
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endoplasmic reticulum, the Golgi apparatus and the cis and trans Golgi networks. In 
addition, Applicants have demonstrated that POSH is necessary for proper secretion, 
localization or processing of a variety of proteins, including phospholipase D, HIV 
Gag, HIV Nef, Rapsyn and Src. Many of these proteins are myristoylated, 
indicating that POSH plays a general role in the processing and proper localization 
of myristoylated proteins. N-myristoylation is an acylation process, which results in 
covalent attachment of myristate, a 14-carbon saturated fatty acid to the N-terminal 
glycine of proteins (Farazi et al., J. Biol. Chem. 276: 39501-04 (2001)). N- 
myristoylation occurs co-translationaly and promotes weak and reversible protein- 
membrane interaction. Myristoylated proteins are found both in the cytoplasm and 
associated with membrane. Membrane association is dependent on protein 
configuration, i.e., surface accessibility of the myristoyl group may be regulated by 
protein modifications, such as phosphorylation, ubiquitination etc. Modulation of 
intracellular transport of myristoylated proteins in the application includes effects on 
transport and localization of these modified proteins. 

As described herein, POSH and POSH-APs are involved in viral maturation, 
including the production, post-translational processing, assembly and/or release of 
proteins in a viral particle. Accordingly, viral infections may be ameliorated by 
inhibiting an activity (e.g., ubiquitin ligase activity or target protein interaction) of 
POSH or a POSH-AP (e.g., inhibition of kinase activity or ubiquitin ligase activity), 
and in preferred embodiments, the virus is a retroid virus, an RNA virus or an 
envelope virus, including HIV, Ebola, HBV, HCV, HTLV, West Nile Virus (WNV) 
or Moloney Murine Leukemia Virus (MMuLV). Additional viral species are 
described in greater detail below. In certain instances, a decrease of a POSH 
function is lethal to cells infected with a virus that employs POSH in release of viral 
particles. 

In certain aspects, the application describes an hPOSH interaction with Rac, 
a small GTPase and the POSH associated kinases MLK, MKK and JNK Rho, Rac 
and Cdc42 operate together to regulate organization of the actin cytoskeleton and the 
MLK-MKK-JNK MAP kinase pathway (referred to herein as the "JNK pathway" or 
"Rac-JNK pathway" (Xu et al., 2003, EMBO J. 2: 252-61). Ectopic expression of 
mouse POSH ("mPOSH") activates the JNK pathway and causes nuclear 
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localization of NF-kB. Overexpression of mPOSH in fibroblasts stimulates 
apoptosis. (Tapon et al. (1998) EMBO J. 17:1395-404). In Drosophila, POSH may 
interact with, or otherwise influence the signaling of, another GTPase, Ras. 
(Schnorr et al. (2001) Genetics 159: 609-22). The JNK pathway and NF-kB 
5 regulate a variety of key genes involved in, for example, immune responses, 
inflammation, cell proliferation and apoptosis. For example, NF-kB regulates the 
production of interleukin 1, interleukin 8, tumor necrosis factor and many cell 
adhesion molecules. NF-kB has both pro-apoptotic and anti-apoptotic roles in the 
cell (e.g., in FAS-induced cell death and TNF-alpha signaling, respectively). NF-kB 
1 0 is negatively regulated, in part, by the inhibitor proteins IkBcc and LcBp (collectively 
. termed "IkB"). Phosphorylation of IkB permits activation and nuclear localization 
of NF-kB. Phosphorylation of IkB triggers its degradation by the ubiquitin system. 
In an additional embodiment, a POSH polypeptide promotes nuclear localization of 
NF-kB. In further embodiments, manipulation of POSH levels and/or activities may 
1 5 be used to manipulate apoptosis. By upregulating POSH or a POSH-AP, apoptosis 
may be stimulated in certain cells, and this will generally be desirable in conditions 
characterized by excessive cell proliferation (e.g., in certain cancers). By 
downregulating POSH or a POSH-AP, apoptosis may be diminished in certain cells, 
and this will generally be desirable in conditions characterized by excessive cell 
20 death, such as myocardial infarction, stroke, degenerative diseases of muscle and 
nerve (particularly Alzheimer's disease), and for organ preservation prior to 
transplant. In a further embodiment, a POSH polypeptide associates with a vesicular 
trafficking complex, such as a clathrin- or coatomer- containing complex, and 
particularly a trafficking complex that localizes to the nucleus and/or Golgi 
25 apparatus. 

As described in WO03/078601A2 (application no. WO2003US0008194), 
POSH is overexpressed in a variety of cancers, and downregulation of POSH is 
associated with a decrease in proliferation in at least one cancer cell line. 
Accordingly, agents that modulate POSH itself or a POSH-AP may be used to treat 
30 POSH associated cancers. POSH associated cancers include those cancers in which 
POSH is overexpressed and/or in which downregulation of POSH leads to a 
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decrease in the proliferation or survival of cancer cells. POSH-associated cancers 
are described in more detail below. In addition, it is notable that many proteins 
shownjierein to be affected by POSH downregulation are themselves involved in 
cancers. Phospholipase D and SRC are both aberrantly processed in a POSH- 
5 impaired cell, and therefore modulation of POSH and/or a POSH-AP may affect the 
wide range of cancers in which PLD and SRC play a significant role. 

As described in WO03/095971A2 (application no. WO2002US0036366) and 
WO03/078601A2 (application no. WO2003US0008194), POSH polypeptides 
function as E3 enzymes in the ubiquitination system. Accordingly, downregulation 
10 or upregulation of POSH ubiquitin ligase activity can be used to manipulate 
biological processes that are affected by protein ubiquitination. Modulation of 
POSH ubiquitin ligase activity may be used to affect POSH-APs and related 
biological processes, and likewise, modulation of POSH-APs may be used to affect 
POSH ubiquitin ligase activity and related processes. Downregulation or 
15 upregulation may be achieved at any stage of POSH formation and regulation, 
including transcriptional, translational or post-translational regulation. For example, 
POSH transcript levels may be decreased by RNAi targeted at a POSH gene 
sequence. As another example, POSH ubiquitin ligase activity may be inhibited by 
contacting POSH with an antibody that binds to and interferes with a POSH RING 
20 domain or a domain of POSH that mediates interaction with a target protein (a 
protein that is ubiquitinated at least in part because of POSH activity). As a further 
example, small molecule inhibitors of POSH ubiquitin ligase activity are provided 
herein. As another example, POSH activity may be increased by causing increased 
expression of POSH or an active portion thereof. POSH, and POSH-APs that 
25 modulate POSH ubiquitin ligase activity may participate in biological processes 
including, for example, one or more of the various stages of a viral lifecycle, such as 
viral entry into a cell, production of viral proteins, assembly of viral proteins and 
release of viral particles from the cell. POSH may participate in diseases 
characterized by the accumulation of ubiquitinated proteins, such as dementias (e.g., 
30 Alzheimer's and Pick's), inclusion body myositis and myopathies, polyglucosan 
body myopathy, and certain forms of amyotrophic lateral sclerosis. POSH may 
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participate in diseases characterized by excessive or inappropriate ubiquitination 

and/or protein degradation. 

3. POSH Associated Proteins 

In certain aspects, the application relates to the discovery of novel 
5 associations between POSH proteins and other proteins (termed POSH-APs), and 
related methods and compositions. In certain aspects, the application relates to 
novel associations among certain disease states, POSH nucleic acids and proteins, 
and POSH-AP nucleic acids and proteins. POSH-APs may interact either directly or 
indirectly with POSH. In certain embodiments, a POSH-AP binds directly to a 
10 POSH polypeptide. 

In certain aspects, the application relates to the discovery that a POSH 
polypeptide interacts with one subunit of Protein Kinase A (PKA; cAMP-dependent 
protein kinase). In one aspect, the application relates to the discovery that POSH 
binds directly with PRKAR1A. This interaction was identified by Applicants in a 
1 5 yeast 2-hybrid assay. Exemplary PKA subunits may include, but are not limited to, 
a regulatory subunit (e.g., PRKAR1A) and a catalytic subunit (e.g., PRKACA or 
PRKACB). PKA is an essential enzyme in the signaling pathway of the second 
messenger cyclic AMP (cAMP). Through phosphorylation of target proteins, PKA 
controls many biochemical events in the cell including regulation of metabolism, ion 
20 transport, and gene transcription. The PKA holoenzyme is composed of two 
regulatory and two catalytic subunits and dissociates from the regulatory subunits 
upon binding of cAMP. The PKA enzyme is inactive in the absence of cAMP. 
Activation of PKA occurs when two cAMP molecules bind to each regulatory 
subunit, eliciting a reversible conformational change that releases active catalytic 
25 subunits. 

A number of human PKA subunits have been characterized, including a 
regulatory subunit (type I alpha: PRKAR1) and two catalytic subunits (C-alpha: 
PRKACA; and C-beta: PRKACB). Boshart et al. identified the regulatory subunit 
PRKAR1 of PKA as the product of the TSE1 locus (Boshart, M et al. (1991) Cell 
30 66: 849-859). The evidence consisted of concordant expression of PRKAR1 mRNA 
and TSE1 genetic activity, high resolution physical mapping of the two genes on 
human chromosome 17, and the ability of transfected PRKAR1 cDNA to generate a 
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phenocopy of TSEl-mediated extinction. Jones et al. independently established 
identity of TSE1 and the Rl-alpha subunit (Jones, KW et al. (1991) Cell 66: 861- 
872). 

Other than a role of PKA in metabolism, PKA subunits have recently been 
5 implicated in multiple diseases. For example, a specific role for localized PRKAR1 
has been demonstrated in human T lymphocytes, where type I PKA localizes to the 
activated TCR complex and is required for attenuation of signals propagated through 
this complex (Skalhegg, BS et al. (1992) J Biol Chem 267:15707-15714; Skalhegg, 
BS et al. (1994) Science 263: 84-87). The importance of type I PKA-mediated 
10 effects in attenuation of T cell replication has led to its consideration as a therapeutic 
target in combined variable immunodeficiency (CVI) and acquired immune 
deficiency syndrome (AIDS). Furthermore, type I PKA in t cells may also serve as 
a potential therapeutic target in systemic lupus erythematosis (SLE). For example, a 
series of recently published articles has uncovered the first human disease mapping 
15 to a PKA subunit-Camey complex (Casey, M et al. (2000) J Clin Invest 106: R31- 
38; Kirschner, LS et al. (2000) Nat Genet 26: 89-92). Carney complex (CNC) is a 
multiple neoplasia syndrome characterized by spotty skin pigmentation, cardiac and 
skin myxomas, endocrine tumors, and psammomatous melanotic schwannomas. 
CNC maps to two genomic loci, 17q24 and 2pl6. Familial cases mapping to the 
20 I7q24 locus reveal deletions/mutations in the PRKAR1 coding exons leading to 
frameshifts and premature stop codons— no mRNA and protein from the mutant 
alleles has been observed. 

Accordingly, in certain aspects of the present disclosure, POSH participates 
in the formation of PKA complexes, including human PKA-containing complexes. 
25 Certain POSH polypeptides may be involved in disorders of the immune system, 
e.g., autoimmune disorders. Certain POSH polypeptides may be involved in the 
regulation of T-cell activation. In certain aspects, POSH participates in the 
ubiquitination of PI3K. In certain aspects, PKA subunit polypeptides participate in 
POSH-mediated processes. 
30 Additionally, the disclosure relates in part to the discovery that PKA 

phosphorylates POSH, and further, that this phosphorylation inhibits the interaction 
of POSH with small GTPases, such as Rac. Small GTPases are important in 
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yesicular trafficking, and therefore the findings disclosed herein demonstrate that 
POSH phosphorylation regulates the formation of complexes between POSH and 
„ - proteins involved in the secretory system, such as Rac, TCL, TC10, Cdc42, Wrch-1, 
Rac2, Rac3 or RhoG. Applicants have shown that inhibition of PKA and POSH has 
5 similar effects, indicating that inhibition of PKA will achieve an effect similar to 
that of inhibition of POSH. However, given the effect of PKA on POSH Afteraction 
with proteins in the secretory pathway, it is expected that PKA regulates the timing 
of cyclical interactions that are needed to effect vesicular trafficking. Accordingly, 
it is expected that significant inhibition or activation of PKA will cause a disruption 

10 in POSH function. 

The term "PKA subunit" is used herein to refer to a full-length human PKA 
subunit which includes a regulatory subunit (e.g., PRKAR1A) and a catalytic 
subunit (e.g., PRKACB or PRKACA), as well as an alternative PKA subunit 
composed of separate PKA subunit sequences (e.g., nucleic acid sequences) that 
15 may be a splice variant. The term "PKA subunit" is used herein to refer as well to 
various naturally occurring PKA subunit homologs, as well as functionally similar 
variants and fragments that retain at least 80%, 90%, 95%, or 99% sequence identity 
to a naturally occurring PKA subunit (e.g., SEQ ID NOs: 264-265, 111-122, 395- 
396). The term specifically includes human PKA subunit nucleic acid and amino 
20 acid sequences and the sequences presented in Figure 36. 

In certain aspects, the application relates to the discovery that a POSH 
polypeptide interacts with human UNC84B, a human homolog of C. elegans Unc- 
84. Accordingly, the application provides complexes comprising POSH and 
UNC84B. In one aspect, the application relates to the discovery that POSH binds 
25 directly with UNC84B. This interaction was identified by Applicants in a yeast 2- 
hybrid assay. In C. elegans, Unc-84 is involved in the cellular positioning of the 
nucleus. UNC84/SUN is positioned at the nuclear membrane and recruits 
Syne/ANC-1, which directly tethers the nuclear envelope to the actin cytoskeleton. 
Accordingly, in certain aspects, POSH participates in formation of a UNC84 
30 complexes, including human UNC84B-containing complexes, and in the 
connections between the nucleus and the cytoskeleton. In certain aspects, UNC84 
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polypeptides participate in POSH-mediated processes. See, for example, Starr and 
Han, 2003, J Cell Sci 116(Pt 2):21 1-6. 

The term UNC84 is used herein to refer to various naturally occurring Unc- 
84 homologs, as well as functionally similar variants and fragments that retain at 
5 least 8 0%, 9 0%, 9 5%, o r 9 9% se quence i dentity t o a n aturally o ccurring UNC84 
(e.g., SEQ ID NOs: 314, 211-213 ). The term specifically includes human UNC84B 
nucleic acid and amino acid sequences and the sequences presented in Figure 36. 

In certain aspects, the application relates to the discovery that a POSH 
polypeptide interacts with human GOCAP1. Certain GOCAP1 polypeptides are 
10 cytoplasmic proteins associated with the Golgi complex. Accordingly, the 
applicationprovidescomplexescomprisingPOSHandGOCAPl. In one aspect, the 
application relates to the discovery that POSH binds directly with GOCAP1. This 
interaction was identified by Applicants in a yeast 2-hybrid assay. In certain 
aspects, these complexes associate with the Golgi complex. GOCAP1 is 
15 synonymous with GCP60. Certain GCP60 polypeptides interact with the Golgi 
complex integral membrane protein, giantin. Certain GCP60 polypeptides are 
involved in the maintenance of the Golgi structure through interaction with giantin 
and affect protein transport between the endoplasmic reticulum and the Golgi 
complex (Sohda, M, et al. (2001) J Biol Chem 276:45298-306). In certain aspects, 
20 GOC AP 1 polypeptides participate in POSH-mediated processes. 

The term GOCAP1 is used herein to refer to various naturally occurring 
GOCAP1 homologs, as well as functionally similar variants and fragments that 
retain at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring 
GOCAP1 (e.g., SEQ ID NOs: 240-243, 61-68). The term specifically includes 
25 human GOCAP1 nucleic acid and amino acid sequences and the sequences 
presented in Figure 36. 

In certain aspects, the application relates to the discovery that a POSH 
polypeptide interacts with human PTPN12, a protein tyrosine phosphatase. 
Accordingly, the application provides complexes comprising POSH and PTPN12. 
30 In one aspect, the application relates to the discovery that POSH binds directly with 
PTPN12. This interaction was identified by Applicants in a yeast 2-hybrid assay. 
9372369 1 

-33- 



Attorney Docket No.: PROL-PWO-024 

P C TV' U Stttf/ O B 3 O El! 



PTPN12 polypeptides are synonymous with the protein tyrosine phosphatase, PTP- 
PEST. PTP-PEST polypeptides contain proline-rich sequences and are rich in 
proline, glutamate, serine, and threonine residues at their carboxyl terminus, features 
characteristic of PEST motifs. Certain PTP-PEST polypeptides interact with 
5 paxillin, a scaffolding protein to which focal adhesion proteins bind, leading to the 
formation of the focal adhesion contact (Shen, Y et al. (1998) J Biol Chem 
273:6474-81). Certain PTP-PEST polypeptides associate with the focal adhesion 
protein, pl30cas (Garton, AJ et al. (1997) Oncogene 15:877-85). Certain PTP- 
PEST polypeptides have also been shown to associate with JAK2, PSTPIP and 
10 WASP, gelsolin, cell adhesion kinase beta, Csk, Hef 1 or Sin , Hic-5, or She (See, 
for example, Horsch, et al (2001) Mol Endocrinol 15:2182-96; Cote, et al (2002) J 
Biol Chem 277:2973-86; Chellaiah, et al (2001) J Biol Chem 276:47434-44; Lyons, 
et al (2001) J Biol Chem 276:24422-31; Davidson, et al (1997) J Biol Chem 
21:1077-88; Cote, JF et al (1998) Biochemistry 37:13128-37; Nishiya, N (1999) J 
15 Biol Chem 274:9847-53; Habib, T et al (1994) J Biol Chem 269:25243-6). Certain 
PTP-PEST polypeptides are involved in inactivation of the Ras pathway (Davidson, 
D and Veillette, A (2001) EMBO J 20:3414-26). The expression level of certain 
PTP-PEST polypeptides can modulate the activity of the GTPase, Racl (Sastry, et al 
(2002) J Cell Sci 115(Pt 22): 4305-16). Certain PTP-PEST polypeptides are 
20 involved in the regulation of cell motility (Garton, AJ and Tonks, NK (1999) J Biol 
Chem 274:3811-8; Angers-Loustau, et al (1999) J Cell Biol 144:1019-31; and 
Sastry, et al. (2002) J Cell Sci 155(Pt 22): 4305-16). Accordingly, certain POSH 
polypeptides are involved in inactivation of the Ras pathway. Certain POSH 
polypeptides are involved in the regulation of cell motility. 
25 Certain PTP-PEST polypeptides are involved in amyloidp-induced neuronal 

dystrophy, a pathological hallmark of Alzheimer's disease (Grace, EA and 
Busciglio, J (2003) J Neurosci. 23:493-502). Accordingly, certain POSH 
polypeptides may be involved in Alzheimer's disease. Certain PTP-PEST 
polypeptides function as negative regulators of lymphocyte activation (Davidson, D 
30 and Veillette, A (2001) EMBO J 20:3414-26). Accordingly, certain POSH 
polypeptides may be involved in the regulation of lymphocyte activation. In certain 
aspects, PTPN12 polypeptides participate in POSH-mediated processes. 
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The term PTPN12 is used herein to refer to various naturally occurring 
PTPN12 homologs, as well as functionally similar variants and fragments that retain 
^ - at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring PTPN12 
(e.g., SEQ ID NOs: 266-268, 123-129). The term specifically includes human 
5 PTPN12 nucleic acid and amino acid sequences and the sequences presented in 
Figure 36. 

In certain aspects, the application relates to the discovery that a POSH 
polypeptide interacts with HERPUD1, a "homocysteine-inducible, endoplasmic 
reticulum stress-inducible, ubiquitin-like domain member 1" protein. Accordingly, 

10 the application provides complexes comprising POSH and HERPUD1. In one 
aspect, the application relates to the discovery that POSH binds directly with 
HERPUD1 . This interaction was identified by Applicants in a yeast 2-hybrid assay. 
HERPUD1 is synonymous with Herp. In part, the present application relates to the 
discovery that a POSH-AP, HERPUD1, is involved in the maturation of an envelope 

15 virus, such as HIV. 

Certain HERPUD1 polypeptides are involved in JNK-mediated apoptosis, 
particularly in vascular endothelial cells, including cells that are exposed to high 
levels of homocysteine. Certain HERPUD1 polypeptides are involved in the 
Unfolded Protein Response, a cellular response to the presence of unfolded proteins 
20 in the endoplasmic reticulum. Certain HERPUD1 polypeptides are involved in the 
regulation, of sterol biosynthesis. Accordingly, certain POSH polypeptides are 
involved in the Unfolded Protein Response and sterol biosynthesis. 

In other aspects, certain HERPUD1 polypeptides enhance presenilin- 
mediated amyloid p-protein generation. For example, HERPUD1 polypeptides, 

25 when overexpressed in cells, increase the level of amyloid p generation, and it is 
observed that HERPUD1 polypeptides interact with the presenilin proteins, 
presenilin-1 and presenilin-2. (See Sai, X. et al (2002) J. Biol. Chem. 277:12915- 
12920). Accordingly, in certain aspects, POSH polypeptides may modulate the level 
of amyloid p generation. Additionally, POSH polypeptides may interact with 

30 presenilin 1 and presenilin 2. Therefore, it is believed certain POSH polypeptides 
modulate presenilin-mediated amyloid p generation. The accumulation of amyloid 
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p is one hallmark of Alzheimer's disease. Accordingly, these POSH polypeptides 
may be involved in the pathogenesis of Alzheimer's disease. At sites such as late 
intracellular compartment sites including the trans-Golgi network, certain mutant 
presenilin-2 polypeptides up-regulate production of amyloid 0 peptides ending at 
5 position 42 (A042). (See Iwata, H. et al (2001) J. Biol. Chem. 276: 21678-21685). 
Accordingly, POSH polypeptides regulate production of A042 through mutant 
presenilin-2 at late intracellular compartment sites including the trans-Golgi 
network. Furthermore, elevated homocysteine levels have been found to be a risk 
factor associated with Alzheimer's disease and cerebral vascular disease. Some risk 
1 0 factors, such as elevated plasma homocysteine levels, may accelerate or increase the 
severity of several central nervous system (CNS) disorders. Elevated levels of 
plasma homocysteine were found in young male patients with schizophrenia 
suggesting that elevated homocysteine levels could be related to the 
pathophysiology of aspects of schizophrenia (Levine, J. et al (2002) Am. J. 
15 Psychiatry 159:1790-2). Accordingly, certain POSH polypeptides may be involved 
in neurological disorders. Neurological disorders include disorders associated with 
increased levels of plasma homocysteine, increased levels of amyloid p production, 
or aberrant presenilin acitivity. Neurological disorders include CNS disorders, such 
as Alzheimer's disease, cerebral vascular disease and schizophrenia. Certain POSH 
20 polypeptides may be involved in cardiovascular diseases, such as thromboembolic 
vascular disease, and particularly the disease characteristics associated with 
hyperhomocysteinemia. See, for example, Kokame et al. 2000 J. Biol. Chem. 
275:32846-53; Zhang et al. 2001 Biochem Biophys Res Commun 289:718-24. 

The term HERPUD1 is used herein to refer to various naturally occurring 
25 HERPUD1 homologs, as well as functionally similar variants and fragments that 
retain at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring 
HERPUD1 (e.g., SEQ ID NOs: 249-252, 77-86). The term specifically includes 
human HERPUD1 nucleic acid and amino acid sequences and the sequences 
presented in Figure 36. 

30 In certain aspects, the application relates to the discovery that a POSH 

polypeptide interacts with one or more Cbl-b polypeptides. Accordingly, the 
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application provides complexes comprising POSH and Cbl-b. In one aspect, the 
application relates to the discovery that POSH binds directly with Cbl-b. This 
interaction was identified by Applicants in a yeast 2-hybrid assay. Cbl-b 
polypeptides contain an amino-terminal variant SH2 domain, a RING finger, and a 
5 carboxyl-terminal proline-rich domain with potential tyrosine phosphorylation sites. 
Cbl-b is highly homologous to the mammalian Cbl and the nematode Sli-1 proteins. 
This application provides four Cbl-b variants and shows that the POSH polypeptide 
interacts with one or more of these variants. In one aspect, the POSH polypeptide 
interacts with a human Cbl-b (UniGene No.: Hs.3144). In another aspect, the POSH 
1 0 polypeptide i nteracts w ith a n a ltemative h uman Cbl-b ( UniGene N o. : H s.38 1 92 1 ) 
that may be a splice variant of Cbl-b. In yet another aspect, the POSH polypeptide 
interacts with a human Cbl-b polypeptide that is a splice variant represented by the 
amino acid sequence depicted in SEQ ID NO: 36 1 , which is encoded by the nucleic 
acid sequence depicted in SEQ ID NO: 359. In yet another aspect, the POSH 
15 polypeptide interacts with a human Cbl-b polypeptide that is a splice variant 
represented by the amino acid sequence depicted in SEQ ID NO: 398, which is 
encoded by the nucleic acid sequence depicted in SEQ ID NO: 360. 

Certain Cbl-b polypeptides have been shown to function as adaptor proteins 
by interacting with other signaling molecules, e.g., interaction with cell surface 
20 receptor tyrosine kinases, e.g., EGFR (Ettenberg, SA et al (2001) J Biol Chem 
276:77-84) or with proteins such as Syk (Elly, C et al (1999) Oncogene 18:1147- 
56), Crk-L (Elly, C et al (1999) Oncogene 18:1147-56), PI3K (Fang, D et al. (2001) 
J Biol Chem 16:4872-8), Grb2 (Ettenberg, SA et al (1999) Oncogene 18:1855-66), 
or Vav (Bustelo, XR et al. (1997) Oncogene 15:2511-20). Certain Cbl-b 
25 polypeptides have been demonstrated to interact directly with the nucleotide 
exchange factor, Vav (Bustelo, XR et al. ( 1997) Oncogene 1 5:2511-20). C ertain 
Cbl-b polypeptides have been shown to function as an E3 ubiquitin ligase that 
recognizes tyrosine phosphorylated substrates through its SH2 domain and through 
its RING domain, recruits a ubiquitin-conjugating enzyme, E2 (Joazeiro, C et al. 
30 (1 999) S cience 2 86:309-3 1 2) A dditionally, certain C bl-b p olypeptides have b een 
shown to associate directly with the p85 subunit of PI3K and to function as an E3 
ligase in the ubiquitination ofPBK (Fang, D et al. (2001) J Biol Chem 16:4872-8). 
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Certain Cbl-b polypeptides are negative regulators of T-cell activation. Cbl- 
b-deficient mice become very susceptible to experimental autoimmune 
'I encephalomyelitis (Chiang, YJ et al. (2000) Nature 403:216-220). Also, Cbl-b- 
deficient mice develop spontaneous autoimmunity (Bachmaier, K, et al (2000) 
5 Nature 403:21 1-216). Furthermore, Cbl-b is a major susceptibility gene for rat type 
1 diabetes mellitus (Yokoi, N et al (2002) Nature Genet. 31 :391-394). 

Accordingly, in certain aspects, POSH participates in the formation of Cbl-b 
complexes, including human Cbl-b-containing complexes. Certain POSH 
polypeptides may be involved in disorders of the immune system, e.g., autoimmune 
10 disorders. Certain POSH polypeptides may be involved in the regulation of T-cell 
activation. In certain aspects, POSH participates in the ubiquitination of PI3K. In 
certain aspects, Cbl-b polypeptides participate in POSH-mediated processes. 

The term Cbl-b is used herein to refer to full-length, human Cbl-b (UniGene 
No.: Hs.3144) as well as an alternative Cbl-b (UniGene No.: Hs.381921) composed 

15 of two separate Cbl-b sequences (e.g., nucleic acid sequences) that may be a splice 
variant. The term Cbl-b is used herein to refer as well to the human Cbl-b splice 
variant represented by the amino acid sequence of SEQ ID NO: 361, which is 
encoded by the nucleic acid sequence of SEQ ID NO: 359 and to the human Cbl-b 
splice variant represented by the amino acid sequence of SEQ ID NO: 398, which is 

20 encoded by the nucleic acid sequence of SEQ ID NO: 360. The term Cbl-b is used 
herein to refer as well to various naturally occurring Cbl-b homologs, as well as 
functionally similar variants and fragments that retain at least 80%, 90%, 95%, or 
99% sequence identity to a naturally occurring Cbl-b (e.g., SEQ ID NOs: 361, 398, 
227-230, 353-360 ). The term specifically includes human Cbl-b nucleic acid and 

25 amino acid sequences and the sequences presented in Figure 36. 

In certain embodiments, the application relates to the discovery that a POSH 
polypeptide interacts with GOSR2. Accordingly, the application provides 
complexes comprising POSH and GOSR2. In one aspect, the application relates to 
the discovery that POSH binds directly with GOSR2. This interaction was 

30 identified by Applicants in a yeast 2-hybrid assay. Certain GOSR2 polypeptides are 
synonymous with GS27 (for Golgi SNARE of 27K) and are involved in trafficking 
membrane proteins between the endoplasmic reticulum and the Golgi and between 
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Golgi subcompartments such as between the cis-,medial- and trans-Golgi network. 
(See, for example, Lowe, SL et al (1997) Nature 389:881-4 and Bui, TD et al (1999) 
57:285-8). Accordingly, certain POSH polypeptides are involved in the trafficking 
of membrane proteins between the endoplasmic reticulum and the Golgi and 
5 between Golgi subcompartments. 

The term GOSR2 is used herein to refer to various naturally occurring 
GOSR2 homologs, as well as functionally similar variants and fragments that retain 
at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring GOSR2 
(e.g., SEQ ID NOs: 244-248, 69-76). The term specifically includes human GOSR2 
10 nucleic acid and amino acid sequences and the sequences presented in Figure 36. 

In certain embodiments, the application relates to the discovery that a POSH 
polypeptide interacts with RALA. Accordingly, the application provides complexes 
comprising POSH and RALA. In one aspect, the application relates to the discovery 
that POSH binds directly with RALA. This interaction was identified by Applicants 
15 in a yeast 2-hybrid assay. RALA polypeptides are GTP-binding polypeptides. 
RALA polypeptides are members of the Ras family of proteins and are GTPases. 
Certain RALA polypeptides may be synonymous with RalA polypeptides. RalA 
polypeptides are small GTPases. RalA polypeptides have been shown to interact 
with phospholipase D and to effect phospholipase D activity. Additionally, RalA 
20 polypeptides may be involved in tumor formation and cell transformation. (See, for 
example, Kim, JH et al (1998) FEBS Lett 430:231-5; Aguirre-Ghiso, JA et al (1999) 
Oncogene 18:4718-25; Lu, Z et al (2000) Mol Cell Biol 20:462-7; Gildea, JJ et al 
(2002) Cancer Res 62:982-5; Lucas, L et al (2002) Int J Oncol 21 :477-85; and Xu, L 
et al (2003) Mol Cell Biol 23:645-54). Accordingly, certain POSH polypeptides 
25 may i nteract w ith P LD and m odulate i ts activity, and certain P OSH p olypeptides 
may be involved in tumor formation and cell transformation. In other aspects, 
certain RalA polypeptides interact with calmodulin and may be involved in 
calcium/calmodulin-mediated intracellular signaling pathways (Clough, RR et al 
(2002) J Biol Chem 277:28972-80). Certain RalA polypeptides are involved in 
30 controlling actin cytoskeletal remodeling and vesicle transport in mammalian cells. 
Certain RalA polypeptides interact with the exocyst complex, which is involved in 
exocytosis. (See, for example, Sugihara, K et al (2002) Nat Cell Biol 4:73-8; Polzin, 
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A et al (2002) Mol Cell Biol 22:1714-22; and Lipschutz, JH and Mostov, KE (2002) 
Curr Biol 12(6):R212-4). Accordingly, certain POSH polypeptides are involved in 
vesicle transport. 

The term RALA is used herein to refer to various naturally occurring RALA 
5 homologs, as well as functionally similar variants and fragments that retain at least 
80%, 90%, 95%, or 99% sequence identity to a naturally occurring RALA (e.g., 
SEQ ID NOs: 269-270, 130-134). The term specifically includes human RALA 
nucleic acid and amino acid sequences and the sequences presented in Figure 36. 

In certain embodiments, the application relates to the discovery that a POSH 
10 polypeptide interacts with SMN1 . Accordingly, the application provides complexes 
comprising POSH and SMN1. In one aspect, the application relates to the discovery 
that POSH binds directly with SMN1. This interaction was identified by Applicants 
in a yeast 2-hybrid assay. SMN1 polypeptides are encoded by the nucleic acid of 
the survival motor neuron gene 1 (SMN1). Mutations in this gene (such as its 
15 homozygous absence) cause spinal muscular atrophy (SMA), a common autosomal 
recessive disorder characterized by degeneration of motor neurons in the spinal cord, 
leading to progressive paralysis with muscular atrophy. Accordingly, POSH may be 
involved in the pathogenesis of SMA. SMN1 is part of a multiprotein complex that 
is required for biogenesis of the Sm class of small nuclear ribonucleoproteins (Sm 
20 snRNPs). SMN1 associates with a number of proteins, such as Gemin2 to Gemin6, 
to form a large complex found in both the cytoplasm and in the nucleus. SMN1 also 
associates with Snurportin 1, an adaptor protein that recognizes the nuclear 
localization signal of Sm snRNPs. (See, for example, Lefebvre, S et al (1995) Cell 
80:155-65; Narayanan, U et al (2002) Hum Mol Genet 11:1785-95; Massenet, S et al 
25 (2002) 22:6533-41; and Monani, UR et al (1999) Hum Mol Genet 8:1177-83). 
Accordingly, certain POSH polypeptides may be involved in the biogenesis of 
snRNPs. Certain SMN1 polypeptides interact with the large nonstructural protein 
NS1 of the autonomous parvovirus minute virus of mice (MVM). NS1 is essential 
for viral replication, and it is a potent transcriptional activator (Young, PJ et al 
30 (2002) J Virol 76:3892-904). Certain SMN1 polypeptides interact with the protein 
NS2 of MVM. NS2 is also required for efficient viral replication. Certain SMN1 
polypeptides colocalize with NS2 in infected nuclei and at late times following 
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MVM infection. (See Young, PJ et al (2002) J Virol 76:6364-9). Accordingly, 
POSH polypeptides are involved in viral replication. 

. _The term SMN1 is used herein to refer to various naturally occurring SMN1 
homologs, as well as functionally similar variants and fragments that retain at least 
5 80%, 90%, 95%, or 99% sequence identity to a naturally occurring SMN1 (e.g., 
SEQ ID NOs: 273-275, 142-146). The term specifically includes human" SMN1 
nucleic acid and amino acid sequences and the sequences presented in Figure 36. 

In certain embodiments, the application relates to the discovery that a POSH 
polypeptide interacts with SMN2. Accordingly, the application provides complexes 
10 comprising POSH and SMN2. In one aspect, the application relates to the discovery 
that POSH binds directly with SMN2. This interaction was identified by Applicants 
in a yeast 2-hybrid assay. The SMN2 gene is an almost identical copy of the SMN1 
gene that causes SMA. A critical difference between the two genes is a 1 nucleotide 
base change inside exon 7 that affects the splicing pattern of the genes. The 
15 majority of the SMN2 transcript lacks exon 7. Certain SMN2 polypeptides 
influence the severity of SMA. (See, for example, Monani, UR et al (1999) Hum 
Mol Genet 8: 1177-83; Cartegni, L and Krainer, AR (2002) Nat Genet 30:377-84; 
and Feldkotter, M et al (2002) Am J Hum Genet 70: 358-68). Accordingly, certain 
POSH polypeptides may influence the severity of SMA. 
20 The term SMN2 is used herein to refer to various naturally occurring SMN2 

homologs, as well as functionally similar variants and fragments that retain at least 
80%, 90%, 95%, or 99% sequence identity to a naturally occurring SMN2 (e.g., 
SEQ ID NOs: 276-280, 147-151). The term specifically includes human SMN2 
nucleic acid and amino acid sequences and the sequences presented in Figure 36. 

25 In certain aspects, the application relates to the discovery that a POSH 

polypeptide interacts with SIAH1. Accordingly, the application provides complexes 
comprising POSH and SIAH1. In one aspect, the application relates to the discovery 
that POSH binds directly with SIAH1 . This interaction was identified by Applicants 
in a yeast 2-hybrid assay. Certain SIAH1 polypeptides bind ubiquitin-conjugating 

30 enzymes and target proteins for proteasome-mediated degradation. Certain SIAH1 

polypeptides are involved in targeting beta-catenin for degradation (Matsuzawa, S 

and Reed, JC (2001) Molec Cell 7: 915-926 and Liu, J et al (2001) Molec Cell 7: 
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927-936). Accordingly, certain POSH polypeptides are involved in the targeting of 
beta-catenin for degradation. Certain SIAH1 polypeptides are E3 ubiquitin ligases 
and regulate the ubiquitination and degradation of synaptophysin (Wheeler, TC et al. 
(2002) J Biol Chem 277: 10273-92). Accordingly, certain POSH polypeptides are 
involved in the ubiquitination and degardation of synaptophysin. Certain SIAH1 
polypeptides regulate the protein, DCC (deleted in colorectal cancer), via the 
ubiquitin-proteosome pathway (Hu, G et al. (1997) Genes Dev 11: 2701-14). 
Accordingly, certain POSH polypeptides are involved in the ubiquitination and 
degardation of DCC. Certain SIAH1 polypeptides are a target of activation of p53 
) and are upregulated by p53, and certain SIAH1 polypeptides are involved in 
apoptosis, tumor suppression, as well as vertebrate development (Maeda, A et al 
(2002) FEBS Lett 512: 223-226; Hu, G et al (1997) Genomics 46:103-111; and 
Nemani,M et al ( 1996) P roc Natl Acad Sci USA 93: 9039-9042). Accordingly, 
certain POSH polypeptides may be a target of p53 activation, and certain POSH 
1 5 polypeptides may be involved in apoptosis and tumor suppression. 

The term SIAH1 is used herein to refer to various naturally occurring SIAH1 
homologs, as well as functionally similar variants and fragments that retain at least 
80%, 90%, 95%, or 99% sequence identity to a naturally occurring SIAH1 (e.g., 
SEQ ID NOs: 271-272, 135-141). The term specifically includes human SIAH1 
20 nucleic acid and amino acid sequences and the sequences presented in Figure 36. 

In certain embodiments, the application relates to the discovery that a POSH 
polypeptide interacts with SYNE1. Accordingly, the application provides 
complexes comprising POSH and SYNE1. In one aspect, the application relates to 
the discovery that POSH binds directly with SYNE1. This interaction was identified 
25 by Applicants in a yeast 2-hybrid assay. SYNE1 polypeptides are synonymous with 
Syne-1, myne-1, and nesprin-1 polypeptides. Syne-1 polypeptides are associated 
with nuclear envelopes in skeletal, cardiac, and smooth muscle cells. Syne-1 
polypeptides contain multiple spectrin repeats. In muscle, myne-1 expression is 
observed in the inner nuclear envelope, and myne-1 has been shown to interact with 
30 the inner nuclear membrane protein lamin A/C. Syne-1 also associates with the 
nuclear envelope protein, emerin. Syne-1 polypeptides may be involved in 
maintaining nuclear organization and structural integrity, and certain Syne-1 
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polypeptides may be involved in the migration of myonuclei in myotubes and/or 
their anchoring at the postsynaptic apparatus. (See, for example, Apel et al (2000) J 
Biol Chem 275:31986-95; Zhang, Q et al (2001) J Cell Sci 114:4485-98; Zhang, Q 
et al (2002) Genomics 80:473-81; and Mislow, JM et al (2002) J Cell Sci 115 (Pt 

5 l):61-70). Accordingly, certain POSH polypeptides may interact with the lamin 
A/C polypeptides and/or emerin polypeptides. Also, certain POSH polypeptides 
may be involved in maintaining nuclear organization and structural integrity, and 
certain POSH polypeptides may be involved in the migration of myonuclei in 
myotubes and/or their anchoring at the postsynaptic apparatus. 

10 The term SYNE1 is used herein to refer to various naturally occurring 

SYNE1 homologs, as well as functionally similar variants and fragments that retain 
at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring SYNE1 
(e.g., SEQ ID NOs: 295-307, 183-201). The term specifically includes human 
SYNE1 nucleic acid and amino acid sequences and the sequences presented in 

15 Figure 36. 

In certain embodiments, the application relates to the discovery that a POSH 
polypeptide interacts with TTC3. Accordingly, the application provides complexes 
comprising POSH and TTC3. In one aspect, the application relates to the discovery 
that POSH binds directly with TTC3 This interaction was identified by Applicants 
20 in a yeast 2-hybrid assay. Certain TTC3 polypeptides are synonymous with the 
proteins, TPRDI, TPRDII, TRPDHI, TPRD and DCRR1 and may be involved in the 
pathogenesis of certain characteristics of Down syndrome, such as morphological 
features, hypotonia, and mental retardation (Tsukahar, F et al (1996) J Biochem 
(Tokyo) 120: 820-827; Ohira, M et al (1996) DNA Res 3: 9-16; Dahmane, N et al 
25 (1998) Genomics 48: 12-23; and Eki, T et al (1997) DNA Seq 7:153-164). 

The term TTC3 is used herein to refer to various naturally occurring TTC3 
homologs, as well as functionally similar variants and fragments that retain at least 
80%, 90%, 95%, or 99% sequence identity to a naturally occurring TTC3 (e.g., SEQ 
ID NOs: 308-312, 202-207). The term specifically includes human TTC3 nucleic 
30 acid and amino acid sequences and the sequences presented in Figure 36. 

In certain embodiments, the application relates to the discovery that a POSH 
polypeptide interacts with VCY21P1. Accordingly, the application provides 
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complexes comprising POSH and VCY2IP1. In one aspect, the application relates 
to the discovery that POSH binds directly with VCY2IP1. This interaction was 

■ - identified by Applicants in a yeast 2-hybrid assay! VCY2IP1 is synonymous with 
VCY2IP-1 , which has been shown to interact with the testis-specific protein, VCY2. 

5 VCY2IP1 is also synonymous with C19orf5, which has been shown to interact with 
the tumor suppressor, RASSF1, suggesting a role for C19orf5 in apopttTsis and 
tumor suppression (In Vitro Cell Dev Biol Anim (2002) 38:582-94). C19orf5 also 
demonstrates a strong homology to microtubule-associated proteins (Genomics 
(2002) 79:124-6). Accordingly, POSH may play a role in apoptosis and tumor 

1 0 suppression. 

The term VCY2IP1 is used herein to refer to various naturally occurring 
VCY2IP1 homologs, as well as functionally similar variants and fragments that 
retain at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring 
VCY2IP1 (e.g., SEQ ID NOs: 315-323, 214-222). The term specifically includes 
15 human VCY2IP1 nucleic acid and amino acid sequences and the sequences 
presented in Figure 36. 

In certain aspects, the application relates to the discovery that a POSH 
polypeptide interacts with MSTP028. In one aspect, the application relates to the 
discovery that POSH binds directly with MSTP028. This interaction was identified 
20 by Applicants in a yeast 2-hybrid assay. In part, the present application relates to the 
discovery that a POSH-AP, MSTP028, is involved in the maturation of an envelope 
virus, such as HIV. Certain MSTP028 polypeptides contain one or more BTB/POZ 
domains that are generally involved in dimerization. Accordingly the application 
provides complexes comprising POSH and MSTP028, optionally in a dimeric form. 
25 The term MSTP028 is used herein to refer to various naturally occurring MSTP028 
homologs, as well as functionally similar variants and fragments that retain at least 
80%, 90%, 95%, or 99% sequence identity to a naturally occurring MSTP028 (e.g., 
SEQ ID NOs: 255-256, 90-94). The term specifically includes human M STP028 
nucleic acid and amino acid sequences and the sequences presented in Figure 36. 
30 In certain embodiments, the application relates to the discovery that a POSH 

polypeptide interacts with SNX1. Accordingly, the application provides complexes 
comprising POSH and SNX1. In one aspect, the application relates to the discovery 
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that POSH binds directly with SNX1. This interaction was identified by Applicants 
in a yeast 2-hybrid assay. SNX1 is a member of the sorting nexin (SNX) protein 
family, which is implicated in regulating membrane traffic. SNX1 is a membrane 
associated protein that has been shown to be involved with targeting receptors to 
5 lysosomal degradation. SNX1 has been shown to bind to the C-terminal tail of the 
D5 dopamine receptor (Mol Cell Biol (1998) 18: 7278-87). Accordingly, in certain 
aspects POSH may associate with the D5 dopamine receptor. SNX1 is involved in 
regulating the targeting of internalized epidermal growth factor receptors for 
lysosomal degradation (Science ( 1996) 272:1008-1010). In certain aspects, POSH 
10 may be involved in targeting proteins for degradation to the lysosome. SNX1 has 
also been found to be involved in sorting PARI, a G-protein coupled receptor for 
thrombin (Mol Cell Biol (2002) 13:1965-76). It has further been demonstrated that 
SNX1 functions in regulating trafficking in the endosome compartment via 
recognition of phosphorylated phosphatidylinositol through the phox homology 
1 5 domain (PX domain) of SNX1 (Proc Natl Acad Sci (2002) 99:6767-72). 

The term SNX1 is used herein to refer to various naturally occurring SNX1 
homologs, as well as functionally similar variants and fragments that retain at least 
' 80%, 90%, 95%, or 99% sequence identity to a naturally occurring SNX1 (e.g., SEQ 
ID NOs: 281-286, 152-161). The term specifically includes human SNX1 nucleic 
20 acid and amino acid sequences and the sequences presented in Figure 36. 

In additional embodiments, the application relates to the discovery that a 
POSH polypeptide interacts with SNX3. Accordingly, the application provides 
complexes comprising POSH and SNX3. In one aspect, the application relates to 
the discovery that POSH binds directly with SNX3. This interaction was identified 
25 by Applicants in a yeast 2-hybrid assay. SNX3 is also a member of the SNX protein 
family. SNX3 has been shown to associate with the early endosome through its PX 
domain, a domain capable of interaction with phosphatidylinositol-3 -phosphate (Nat 
Cell Biol (2002) 3:658-66). Accordingly, POSH may be involved in membrane 
traffic at the early endosome. 
30 The term SNX3 is used herein to refer to various naturally occurring SNX3 

homologs, as well as functionally similar variants and fragments that retain at least 
80%, 90%, 95%, or 99% sequence identity to a naturally occurring SNX3 (e.g., SEQ 
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ID NOS: 287-290, 162-174). The term specifically includes human SNX3 nucleic 
acid and amino acid sequences and the sequences presented in Figure 36. 

In further embodiments, the application relates to the discovery that a POSH 
polypeptide interacts with ATP6V0C. Accordingly, the application provides 
5 complexes comprising POSH and ASTP6V0C. In one aspect, the application relates 
to the discovery that POSH binds directly with ATP6V0C. This interaction was 
identified by Applicants in a yeast 2-hybrid assay. ATP6V0C, vacuolar-H(+)- 
ATPase, is a large multimeric protein composed of at least twelve distinct subunits 
and it is involved in the H(+) transport across cellular membranes. ATP6V0C is 
10 synonymous with ATP6L. Treatment with anticancer agents has been shown to 
enhance ATP6L expression (Cytogenet Genome Res (2002) 97:111-5; J Biol Chem 
(2002) 277:36534-43). 

The term ATP6V0C is used herein to refer to various naturally occurring 
ATP6V0C homologs, as well as functionally similar variants and fragments that 
15 retain at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring 
ATP6V0C (e.g., SEQ ID NOs: 225-226, 345-351). The term specifically includes 
human ATP6V0C nucleic acid and amino acid sequences and the sequences 
presented in Figure 36. 

In certain aspects, the application relates to the discovery that a POSH 
20 polypeptide interacts with PPP1CA. Accordingly, the application provides 
complexes comprising POSH and PPP1CA. In one aspect, the application relates to 
the discovery that POSH binds directly with PPP1CA. This interaction was 
identified by Applicants in a yeast 2-hybrid assay. PPP1CA is the protein 
phosphatase type 1 alpha catalytic subunit. The genetic and expression status of the 
25 PPP1CA gene was examined in 55 human cancer cell lines and found to be 
ubiquitously expressed and lacking in genetic variation, suggesting an essential role 
for PPP1CA in the growth of cancer cells (Int J Oncol (2001) 18:817-24). 

The term PPP1CA is used herein to refer to various naturally occurring 
PPP1CA homologs, as well as functionally similar variants and fragments that retain 
30 at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring PPP1CA 
(e.g., SEQ ID NOs: 261-263, 101-110). The term specifically includes human 
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PPP1CA nucleic acid and amino acid sequences and the sequences presented in 
Figure 36. 

" _The application further relates to the discovery that a POSH polypeptide 
interacts with DDEF1. Accordingly, the application provides complexes comprising 
5 POSH and DDEF1 . In one aspect, the application relates to the discovery that POSH 
binds directly with DDEF1. This interaction was identified by Applicants iff a yeast 
2-hybrid assay. DDEF1 is a putative candidate gene associated with Meckel-Gruber 
syndrome (MKS), the most common monogenic cause of neural tube defects (Hum 
Genet (2002) 111:654-61). 
10 The term DDEF1 is used herein to refer to various naturally occurring 

DDEF1 homologs, as well as functionally similar variants and fragments that retain 
at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring DDEF1 
(e.g., SEQ ID NOs: 233-237, 48-54). The term specifically includes human DDEF1 
nucleic acid and amino acid sequences and the sequences presented in Figure 36. 
1 5 In certain embodiments, the application relates to the discovery that a POSH 

polypeptide interacts with PACS-1. Accordingly, the application provides 
complexes comprising POSH and PACS-1. In one aspect, the application relates to 
the discovery that POSH binds directly with PACS-1. This interaction was 
identified by Applicants in a yeast 2-hybrid assay. PACS-1 is a cytosolic sorting 
20 protein that directs localization of membrane proteins in the TGN/endosomal 
system. PACS-1 is a cytosolic protein involved in controlling the correct subcellular 
localization of integral membrane proteins that contain acidic cluster sorting motifs, 
such as furin and HIV-1 Nef, and PACS-1 has been shown to interact with the 
adaptor complexes AP-1 and AP-3 (EMBO J (2003) 22:6234-44; EMBO J (2001) 
25 20:2191-201). Furthermore, PACS-1 polypeptides have been shown to interact with 
Nef and through this interaction, by a PBK-dependent proces, MHC class I 
molecules are downregulated by Nef (Cell (2002) 11:853-66). Accordingly, POSH 
may be involved in Nef-mediated downregulation of MHC class I molecules in a 
cell infected with HIV-1. Additionally, PACS-1 interacts with the HIV-1 protein, 
30 Vpu. Vpu expresses an acidic amino acid sorting motif that is required for TGN 
localization through a retroviral process mediated by PACS-1 (Wan, L et al (1998) 
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Cell 94:205-216). Accordingly, in certain aspects, POSH may associate with Vpu 
through its interaction with PACS-1. 

The term PACS-1 is used herein to refer to various naturally occurring 
PACS-1 homologs, as well as functionally similar variants and fragments that retain 
5 at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring PACS-1 
(e.g., SEQ ID NOs: 362-366, 95-100). The term specifically includes human PACS- 
1 nucleic acid and amino acid sequences and the sequences presented in Figure 36. 

In certain aspects, the application relates to the discovery that a POSH 
polypeptide interacts with EPS8L2. Accordingly, the application provides 
10 complexes comprising POSH and EPS8L2. In one aspect, the application relates to 
the discovery that POSH binds directly with EPS8L2. This interaction was 
identified by Applicants in a yeast 2-hybrid assay. EPS8L2 is an eps8-related 
protein. Eps8 forms a multimeric complex with Sos-1, Abil and PI3K that is 
required for Rac activation leading to actin remodelling. EPS8L2 has been shown to 
15 interact with Abil and Sos-1. EPS8L2 also has been shown to localize to PDGF- 
induced F-actin-rich ruffles and to restore receptor tyrosine kinase mediated actin 
remodeling when expressed in eps8-/- fibroblasts (Mol Biol Cell (2004) 15:91-8). 

The term EPS8L2 is used herein to refer to various naturally occurring 
EPS8L2 homologs, as well as functionally similar variants and fragments that retain 
20 at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring EPS8L2 
(e.g., SEQ ED NOs: 239, 58-60). The term specifically includes human EPS8L2 
nucleic acid and amino acid sequences and the sequences presented in Figure 36. 

The application additionally relates to the discovery that a POSH polypeptide 
interacts with HIP55. Accordingly, the application provides complexes comprising 
25 POSH and HIP55. In one aspect, the application relates to the discovery that POSH 
binds directly with HIP55. This interaction was identified by Applicants in a yeast 
2-hybrid assay. HIP55 is a cytoplasmic adaptor protein that has been shown to bind 
to the cytoplasmic tail of the CD2v protein of African swine fever virus ( J Gen 
Virol (2004) 85:119-30). HIP55 (synonymous with mAbpl and SH3P7) comprises 
30 an SH3 domain and through its SH3 domain, associates with dynamin (J Cell Biol 
(2001) 153:351-66; Biochem Biophys Res Commun (2003) 301:704-10). 
Accordingly, in certain aspects, POSH may associate with dynamin through its 
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interaction with HIP55. HIP55 has also been shown to be important for receptor 
mediated endocytosis o f the transferrin receptor (Biochem Biophys Res C ommun 

(2003) 301:704-10). 

The term HIP55 is used herein to refer to various naturally occurring HIP55 
5 homologs, as well as functionally similar variants and fragments that retain at least 
80%, 90%, 95%, or 99% sequence identity to a naturally occurring HIP55 (e.g., SEQ 
ID NOs: 390-394, 377-385). The term specifically includes human HIP55 nucleic 
acid and amino acid sequences and the sequences presented in Figure 36. 

In certain embodiments, the application relates to the discovery that a POSH 
0 polypeptide interacts with CENTB1. Accordingly, the application provides 
complexes comprising POSH and CENTB1. In one aspect, the application relates to 
the discovery that POSH binds directly with CENTBL This interaction was 
identified by Applicants in a yeast 2-hybrid assay. CENTB1 is synonymous with 
ACAP1. ACAP1 is an ARE GTPase activating protein (ARF GAP). ACAP1 can 
15 function as a GAP for ARF1 and ARF6 (J Biol Chem (2002) 277:7962-9). 

The term CENTB1 is used herein to refer to various naturally occurring 
CENTB1 homologs, as well as functionally similar variants and fragments that 
retain at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring 
CENTB1 (e.g., SEQ ID NOs: 231-232, 37-47). The term specifically includes 
20 human CENTB1 nucleic acid and amino acid sequences and the sequences presented 
in Figure 36. 

In certain embodiments, the application relates to the discovery that a POSH 
polypeptide interacts with EIF3S3. Accordingly, the application provides 
complexes comprising POSH and EIF3S3. In one aspect, the application relates to 

25 the discovery that POSH binds directly with EIF3S3. This interaction was identified 
by Applicants in a yeast 2-hybrid assay. EIF3S3 is elevated in certain hepatocellular 
carcinomas and in prostate cancer (Hepatology (2003) 38:1242-9; Am J Pathol 
(2001) 159:2081-84). It has also been demonstrated that EIF3S3 is often amplified 
and o verexpressed i n b reast c ancer ( Genes C hromosomes C ancer. (2000) 2 8:203- 

30 210). 

The term EIF3S3 is used herein to refer to various naturally occurring 
EIF3S3 homologs, as well as functionally similar variants and fragments that retain 



9372369_1 

-49- 



Attorney DoeketNo.: PROL-PWO-024 

:; T/ u s o#/ o B 3 o s 



at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring EIF3S3 
(e.g., SEQ ID NOs: 238, 55-57). The term specifically includes human EIF3S3 
nucleic acid and amino acid sequences and the sequences presented in Figure 36. 

In certain embodiments, the application relates to the discovery that a POSH 
> polypeptide interacts with SRA1 . Accordingly, the application provides complexes 
comprising POSH and SRA1. In one aspect, the application relates to the discovery 
that POSH binds directly with SRA1. This interaction was identified by Applicants 
in a yeast 2-hybrid assay. SRA1 is a transcriptional coactivator, steroid receptor 
RNA activator 1. SRA is selective for steroid hormone receptors and mediates 
0 transactivation via their amino-terminal activation function (Cell (1999) 97:17-27). 
The term SRA1 is used herein to refer to various naturally occurring SRA1 
homologs, as well as functionally similar variants and fragments that retain at least 
80%, 90%, 95%, or 99% sequence identity to a naturally occurring SRA1 (e.g., SEQ 
ID NOs: 291-294, 175-182). The term specifically includes human SRA1 nucleic 
1 5 acid and amino acid sequences and the sequences presented in Figure 36. 

The application additionally relates to the discovery that a POSH polypeptide 
interacts with WASF1. Accordingly, the application provides complexes 
comprising POSH and WASF1. In one aspect, the application relates to the 
discovery that POSH binds directly with WASF1. This interaction was identified by 
20 Applicants in a yeast 2-hybrid assay. WASF1 is a member of the Wiskott-Aldrich 
syndrome protein (WASP) family of proteins. WASF-1 has been shown to regulate 
cortical actin filament reorganization in response to extracellular stimuli. WASF1 is 
synonymous with WAVE1 and is an actin regulatory protein. It has been shown that 
Ras and the adaptor protein Nek activate actin nucleation through WAVE1 (Nature 

25 (2002)418:790-3). 

The term WASF1 is used herein to refer to various naturally occurring 
WASF1 homologs, as well as functionally similar variants and fragments that retain 
at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring WASF1 
(e.g., SEQ ID NOs: 389, 375-376). The term specifically includes human WASF1 
30 nucleic acid and amino acid sequences and the sequences presented in Figure 36. 

The application additionally relates to the discovery that a POSH polypeptide 
interacts with SPG20. Accordingly, the application provides complexes comprising 
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POSH and SPG20. In one aspect, the application relates to the discovery that POSH 
binds directly with SPG20. This interaction was identified by Applicants in a yeast 
^ - 2-hybrid assay. SPG20 is synonymous with spartin, and mutation in the gene has 
been implicated in Troyer syndrome, an autosomal recessive complicated hereditary 
5 spastic paraplegia. Comparative sequence analysis has shown that spartin shares 
similarity with molecules involved in endosomal trafficking (Nat Genef (2002) 
31:347-8). 

The term SPG20 is used herein to refer to various naturally occurring SPG20 
homologs, as well as functionally similar variants and fragments that retain at least 

10 80%, 90%, 95%, or 99% sequence identity to a naturally occurring SPG20 (e.g., 
SEQ ID NOs: 386-388, 367-374). The term specifically includes human SPG20 
nucleic acid and amino acid sequences and the sequences presented in the Figure 36. 

In further embodiments, the application relates to the discovery that a POSH 
polypeptide interacts with HLA-A. Accordingly, the application provides 

15 complexes comprising POSH and HLA-A. In one aspect, the application relates to 
the discovery that POSH binds directly with HLA-A. This interaction was identified 
by Applicants in a yeast 2-hybrid assay. In additional aspects, the application relates 
to the discovery that a POSH polypeptide interacts with HLA-B. Accordingly, the 
application provides complexes comprising POSH and HLA-B. In one aspect, the 

20 application relates to the discovery that POSH binds directly with HLA-B. This 
interaction was identified by Applicants in a yeast 2-hybrid assay. HLA-A and 
HLA-B are MHC class I molecules. HLA-A and HLA-B molecules are 
downregulated in the progression of AIDS, and this downregulation is associated 
with the activity of HIV- 1 Nef. 

25 The term HLA-A is used herein to refer to various naturally occurring HLA- 

A homologs, as well as functionally similar variants and fragments that retain at 
least 8 0%, 9 0%, 9 5%, o r 9 9% se quence i dentity t o a n aturally o ccurring H LA-A 
(e.g., SEQ ID NOs: 253, 87-88). The term specifically includes human HLA-A 
nucleic acid and amino acid sequences and the sequences presented in Figure 36. 

30 The term HLA-B is used herein to refer to various naturally occurring HLA- 

B homologs, as well as functionally similar variants and fragments that retain at 
least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring HLA-B 
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(e.g., SEQ ID NOs: 254, 89). The term specifically includes human HLA-B nucleic 
acid and amino acid sequences and the sequences presented in Figure 36. 

In certain aspects, the application relates to the discovery that a POSH 
polypeptide interacts with a ubiquitin-conjugating enzyme (E2). An exemplary E2 
5 may include, but are not limited to, UBC5a, UBC5c, UBC6, and UBC13. UBC13 is 
often found in a heterodimer complex with a Ub conjugating enzymer variant (UEV) 
protein, such as, for example, UEV la. (See Hofmann and Pickart, Noncanonical 
MMS2-Encoded Ubiquitin-Conjugating Enzyme Functions in Assembly of Novel 
Ubiquitin Chains for DNA Repair, Cell 96: 645-653 (1999), McKenna et al., 2002, 
10 Energetics and Specificity of Interactions within Ub-Uev-Ubcl3 Human Ubiquitin 
Conjugating Complexs, Biochemistry. Vol. 42. pp.7922-7930, and Ulrich, 2003, 
Protein-Protein Interactions within an E2-RING Finger Complex, The Jurnal of 
Biological Chemistry, Vol. 278. No 9. pp. 7051-7058). UVE proteins share 
significant sequence and structural similarities with E2s, yet lack the requisite active 
15 site cystine of the classical E2 protein family. 

Generally, UBC5 conjugates ubiquitin to Lysine 48 in a target protein, a 
signal that marks the protein for degradation by the 26 S proteosome. In constrast, 
UBC13/UEVla conjugates ubiquitin to Lysine 63 residue in a target protein, which 
is not a degradation signal. Instead, ubiquitin conjugated aj Lysine 63 has been 
20 implicated in diverse biological processes, including, for example, DNA damage 
repair, endocytosis, ribosome biogenesis, mitochondrial inheritance, and NFkB 
signaling (See Ulrich, 2003). The UBC13/UEVla has been shown to work with two 
other RING-ubiquitin ligases, TRAF6 and RAD5. (See Ulrich, 2003). TRAF6- 
UBC13-UEVla complex ubiquitinates TRAF6 (self-ubiquitination), thus enabling it 
25 to activate a kinase cascade. 

Without being bound to theory, it appears that UBC5a, UBC5c and UBC6 
may work with POSH in one pathway, while UBC13/UEVla work with POSH in 
another distinct pathway. This is supported by the fact that UBC5/6 marks POSH 
for degradation by conjugating ubiquitin at Lysine 48, whereas UBC13/UEVla 
30 marks POSH for purposes other than degradation by conjugating ubiquitin at Lysine 
63. This theory is further supported by the factthatUBC5a,UBC5candUBC6 
share high sequence similarities. 
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Accordingly, in certain aspects, the present application relates to an isolated, 
purified or recombinant complex comprising a POSH polypeptide and a UBC13. In 
certain aspects, the present application relates to an isolated, purified or recombinant 
complex comprising: a polypeptide comprising a domain that is at least 90% 

5 identical to a POSH RING domain, and a POSH-AP comprising an E2. An 
exemplary POSH associated protein E2 include, for example, is UBC13. UBC13 
may be in a h eterodimer c omplex with a Ub conjugating enzymer variant (UEV) 
protein, such as, for example, UEV la. 

The term "UBC13" and is used herein to refer to full-length UBC 13, any 

10 splice variants thereof, various naturally occurring UBC13 homologs, as well as 
functionally similar variants and fragments that retain at least 80%, 90%, 95%, or 
99% sequence identity to a naturally occurring UBC13 (e.g., SEQ ID NOs: 313, 
208-210). The term specifically includes UBC13 nucleic acid and amino acid 
sequences and the sequences presented in Figure 36. 

15 In certain embodiments, the application relates to the interaction between an 

ARF5 polypeptide and a POSH polypeptide. ARF5 is a member of the ARF gene 
family. The ARF proteins stimulate the in vitro ADP-ribosyltransferase activity of 
cholera toxin. ARF proteins play a role in vesicular trafficking in vivo. ARFs are 
members of the Ras GTPase superfamily. ARFs activate specific PLDs. 

20 Mammalian ARFs are divided into three classes based on size, amino acid sequence, 
gene structure, and phylogenetic analysis. ARF1 is in class I, and ARF5 is in class 
II. 

In certain embodiments, the application relates to the interaction between an 
ARF1 polypeptide and a POSH polypeptide. ARF1 is a small G protein involved in 

25 vesicular trafficking. The assembly/disassembly cycle of the coat protein I (COPI) 
on Golgi membranes is coupled to the GTP/GDP cycle of ARF1 (Nature (2003) 
426:563-6). ARF1 has been implicated in mitotic Golgi disassembly, chromosome 
segregation, and cytokinesis (Proc Natl Acad Sci (2003) 100:13314-9). ARF1 .has 
been shown to bind to the 5-HT2A receptor, a G protein coupled receptor (GPCR) 

30 (Mol Pharmacol (2003) 64:1239-50). 

The term ARF-1 is used herein to refer to various naturally occurring ARF-1 
homologs, as well as functionally similar variants and fragments that retain at least 
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80%, 90%, 95%, or 99% sequence identity to a naturally occurring ARF-1 (e.g. SEQ 
ID NOs: 223, 325-339). The term specifically includes human ARF-1 nucleic acid 
and amino acid sequences and the sequences presented in Figure 36. 

The term ARF-5 is used herein to refer to various naturally occurring ARF-5 
5 homologs, as well as functionally similar variants and fragments that retain at least 
80%, 90%, 95%, or 99% sequence identity to a naturally occurring ARF-5 (e.g., 
SEQ ID NOs: 224, 340-344). The term specifically includes human ARF-5 nucleic 
acid and amino acid sequences and the sequences presented in Figure 36. 

In certain embodiments, the application relates to the inhibition of viral 
10 maturation by modulation of an activity associated with a dynamin II polypeptide. 
Dynamin II is a large GTP-binding protein that is involved in endocytosis and in 
vesicle formation at the trans-Golgi network. Dynamin H contains a pleckstrin 
homology domain (PHD) and a proline-rich domain (PRD). Dynamin H plays an 
important role in vesicle formation at the plasma membrane, trans-Golgi network, 
15 and various other intracellular organelles. Accordingly, disrupting the activity of a 
dynamin II polypeptide or the interaction between a POSH polypeptide and a 
dynamin II polypeptide (e.g., by reducing POSH protein levels or alternatively, 
reducing dynamin H protein levels, through RNAi) may disrupt the activity of 
dynamin U in the secretory pathway and prevent the secretion of viral proteins, such 
20 as, for example, HBV proteins. Dynamin II participates in the transport and 
secretion of HBV proteins (Abdulkarim, AS et al (2003) J. Hepat. 38:76-83). 
Accordingly, in certain embodiments, inhibition of POSH adversely effects the 
' transport and release of HBV proteins. 

In certain embodiments, the application relates to the inhibition of dynamin 
25 activity, in particular the inhibition of the activity of dynamin II, a member of the 
dynamin family of proteins. In certain embodiments, the application relates to 
inhibition of dynamin H activity, which inhibition disrupts the transport and 
secretion of HBV proteins. The term dynamin II is used herein to refer to full- 
length, human dynamin II as well as various naturally occurring dynamin H 
30 homologs, as well as functionally similar variants and fragments that retain at least 
80%, 90%, 95%, or 99% sequence identity to a naturally occurring dynamin II (e.g., 
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public gi number: 1196422, public gi number: 1706539, public gi number: 1 196423, 
and public gi number: 1363934). 

In certain embodiments, the application, relates to the inhibition of viral 
maturation by modulation of an activity associated with a Vpu polypeptide. Vpu is 

5 an HIV-1 encoded ion channel, which, among other tasks in the HIV-1 life cycle, is 
necessary for efficient virus budding (Schubert, U et al (1995) J. Virol. 69:7699- 
7711). Vpu may function at the trans Golgi network (TGN). Vpu expresses an 
acidic amino acid sorting motif that is required for TGN localization through a 
retroviral process mediated by the POSH-AP, P ACS-1 (Wan, L et al ( 1998) Cell 

10 94:205-216). Moreover, the phenotype conferred by human POSH knockdown is 
similar to that observed in cells expressing HIV-1 lacking Vpu where viruses also 
accumulate in intracellular membranes (Klimkait, T et al (1990) J. Virol. 64:621- 
629). 

Vpu regulates virus release from a post-endoplasmic reticulum compartment, 
1 5 such as possibly the TGN, by an ion channel activity mediated by its transmembrane 
anchor. Vpu also induces the selective down regulation of host cell receptor 
proteins such as CD4 and major histocompatibility complex class I molecules, in a 
process involving its cytoplasmic tail. Furthermore, Vpu-mediated degradation of 
CD4 is dependent on an intact ubiquitin-conjugating system. (See Schubert, U et al 
20 (1998) J. Virol. 72:2280-8). In certain embodiments of the present invention, Vpu- 
mediated degradation of a protein such as CD4 may involve a ubiquitin-conjugating 
system that includes a POSH polypeptide or a POSH-AP, such as, for example, Cbl- 
b. 

Vpu nucleic acid and the corresponding amino acid sequence encoded 
25 thereby are exemplified by the Vpu discussed in Strebel, K et al (1988) 241:1221- 
1223. The term Vpu is used herein to refer as well to Vpu of other HIV-1 isolates, 
such as the Vpu disclosed in GenBank, accession number U51190, and the Vpu 
disclosed in GenBank, accession number U52953. The term Vpu is used herein to 
refer as well to various naturally occurring Vpu homologs, as well as functionally 
30 similar variants and fragments that retain at least 80%, 90%, 95%, or 99% sequence 
identity to a naturally occurring Vpu. 

4. Methods and Compositions for Treating P OSH-associated Diseases 
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In certain aspects, the application provides methods and compositions for 
treatment of POSH-associated diseases (disorders), including cancer and viral 
disorders, as well as disorders associated with unwanted apoptosis, including, for 
example a variety of neurodegenerative disorders, such as Alzheimer's disease. 
5 In certain embodiments, the application relates to viral disorders (e.g., viral 

infections), and particularly disorders caused by retroid viruses, RNA viruses and/or 
envelope viruses. In view of the teachings herein, one of skill in the art will 
understand that the methods and compositions of the application are applicable to a 
wide range of viruses such as, for example, retroid viruses, RNA viruses, and 
1 0 envelope viruses. In a preferred embodiment, the present application is applicable to 
retroid viruses. In a more preferred embodiment, the present application is further 
applicable to retroviruses (retroviridae). In another more preferred embodiment, the 
present application is applicable to lentivirus, including primate lentivirus group. In 
a most preferred embodiment, the present application is applicable to Human 
1 5 Immunodeficiency virus (HTV), Human Immunodeficiency virus type- 1 (HIV-1), 
Hepatitis B Virus (HBV) and Human T-cell Leukemia Virus (HTLV). 

While not intended to be limiting, relevant retroviruses include: C-type 
retrovirus which causes lymphosarcoma in Northern Pike, the C-type retrovirus 
which infects mink, the caprine lentivirus which infects sheep, the Equine Infectious 
20 Anemia Virus (EIAV), the C-type retrovirus which infects pigs, the Avian Leukosis 
Sarcoma Virus (ALSV), the Feline Leukemia Virus (FeLV), the Feline Aids Virus, 
the Bovine Leukemia Virus (BLV), Moloney Murine Leukemia Virus (MMuLV), 
the Simian Leukemia Virus (SLV), the Simian hnmuno-deficiency Virus (SIV), the 
Human T-cell Leukemia Virus type-I (HTLV-I), the Human T-cell Leukemia Virus 
25 type-II (HTLV-H), Human Immunodeficiency virus type-2 (HIV-2) and Human 
Immunodeficiency virus type- 1 (HIV-1). 

The method and compositions of the present application are further 
applicable to RNA viruses, including ssRNA negative-strand viruses and ssRNA 
positive-strand viruses. The ssRNA positive-strand viruses include Hepatitis C 
30 Virus (HCV). In a preferred embodiment, the present application is applicable to 
mononegavirales, including filoviruses. Filoviruses further include Ebola viruses 
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and. Marburg viruses. In another preferred embodiment, the present invention is 
applicable to flaviviruses, including West Nile Virus (WNV). 

Other RNA viruses include picornaviruses such as enterovirus, poliovirus, 
coxsackievirus and hepatitis A virus, the caliciviruses, including Norwalk-like 
5 viruses, the rhabdoviruses, including rabies virus, the togaviruses including 

alphaviruses, Semliki Forest virus, denguevirus, yellow fever virus and rubella: virus, 
the orthomyxoviruses, including Type A, B, and C influenza viruses, the 
bunyaviruses, including the Rift Valley fever virus and the hantavirus, the 
filoviruses such as Ebola virus and Marburg virus, and the paramyxoviruses, 
1 0 including mumps virus and measles virus. Additional viruses that may be treated 
include herpes viruses. 

The methods and compositions of the present application are further 
applicable to hepatotrophic viruses, including HAV, HBV, HCV, HDV, and HEV. 
In certain aspects, the application relates to a method of inhibiting a hepatotrophic 
1 5 virus, comprising administering a POSH inhibitor to a subject in need thereof. In 
further aspects, the application relates to a method of treating a viral hepatitis 
infection, comprising administering a POSH inhibitor to a subject in need thereof. 
A viral hepatitis infection may be caused by a hepatotrophic virus, such as HAV, 
HBV, HCV, HDV, or HEV. In certain embodiments, the application relates to a 
20 method of treating an HBV infection by administering a POSH inhibitor to a subject 
in need thereof. 

In other embodiments, the application relates to methods of treating or 
preventing cancer diseases. The terms "cancer," "tumor," and "neoplasia" are used 
interchangeably herein. As used herein, a cancer (tumor or neoplasia) is 

25 characterized by one or more of the following properties: cell growth is not 

regulated by the normal biochemical and physical influences in the environment; 
anaplasia (e.g., lack of normal coordinated cell differentiation); and in some 
instances, metastasis. Cancer diseases include, for example, anal carcinoma, bladder 
carcinoma, breast carcinoma, cervix carcinoma, chronic lymphocytic leukemia, 

30 chronic myelogenous leukemia, endometrial carcinoma, hairy cell leukemia, head 
and neck carcinoma, lung (small cell) carcinoma, multiple myeloma, non-Hodgkin's 
lymphoma, follicular lymphoma, ovarian carcinoma, brain tumors, colorectal 
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carcinoma, hepatocellular carcinoma, Kaposi's sarcoma, lung (non-small cell 
carcinoma), melanoma, pancreatic carcinoma, prostate carcinoma, renal cell 
carcinoma, and soft tissue sarcoma. Additional cancer disorders can be found in, for 
example, Isselbacher et al. (1994) Harrison's Principles of Internal Medicine 1814- 
5 1 877, herein incorporated by reference. 

In a specific embodiment, anticancer therapeutics of the application are used 
in treating a POSH-associated cancer. As described herein, POSH-associated 
cancers include, but are not limited to, the thyroid carcinoma, liver cancer 
(hepatocellular cancer), lung cancer, cervical cancer, ovarian cancer, renal cell 
1 0 carcinoma, lymphoma, osteosacoma, liposarcoma, leukemia, breast carcinoma, and 
breast adeno-carcinoma. 

Preferred antiviral and anticancer therapeutics of the application can function 
by disrupting the biological activity of a POSH polypeptide or POSH complex in 
viral maturation. Certain therapeutics of the application function by disrupting the 
1 5 activity of a POSH-AP (e.g., HERPUD 1 ) in viral maturation. Certain therapeutics 
of the application function by disrupting the activity of POSH by inhibiting the 
ubiquitin ligase activity of a POSH polypeptide. In certain embodiments of the 
application, a therapeutic of the application inhibits the ubiquitination of a POSH- 
AP, such as for example the ubiquitination of HERPUD1. 
20 In other embodiments, the application relates to methods of treating or 

preventing neurological disorders. In one aspect, the invention provides methods 
and compositions for the identification of compositions that interfere with the 
function of a POSH or a POSH-AP, which function may relate to aberrant protein 
processing associated with a neurodegenerative disorder, such as for example, the 
25 processing of amyloid beta precursor protein associated with Alzheimer' s disease. 
Neurological disorders include disorders associated with increased levels of amyloid 
P production, such as for example, Alzheimer's disease. Neurological disorders also 
include Parkinson's disease, Huntington's disease, schizophrenia, Niemann-Pick's 
disease, and prion-associated diseases 

30 Exemplary therapeutics of the application include nucleic acid therapies such 

as, for example, RNAi constructs (small inhibitory RNAs), antisense 
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oligonucleotides, ribozyme, and DNA enzymes. Other therapeutics include 
polypeptides, peptidomimetics, antibodies and small molecules. 

Antisense therapies of the application include methods of introducing 
antisense nucleic acids to disrupt the expression of POSH polypeptides or proteins 
5 that are necessary for POSH function. 

RNAi therapies include methods of introducing RNAi constructs to 
downregulate the expression of POSH polypeptides or POSH-APs (e.g., 
HERPUD1). In certain embodiments, RNAi therapeutics are delivered to the liver 
(e.g., to hepatocytes). Exemplary RNAi therapeutics include any one of SEQ ID 
10 NOs: 15, 16, 18, 19, 21, 22, 24 and 25. 

Therapeutic polypeptides may be generated by designing polypeptides to 
mimic certain protein domains important in the formation of POSH: POSH-AP 
complexes, such as, for example, SH3 or RING domains. For example, a 
polypeptide comprising a POSH SH3 domain such as, for example, the SH3 domain 
15 as set forth in SEQ ED NO: 30 will compete for binding to a POSH SH3 domain and 
will therefore act to disrupt binding of a partner protein. In one embodiment, a 
binding partner may be a Gag polypeptide. In another embodiment, a binding 
partner may be Rac. In a further embodiment, a polypeptide that resembles an L 
domain may disrupt recruitment of Gag to the POSH complex. 
20 In view of the specification, methods for generating antibodies directed to 

epitopes of POSH and POSH-APs are known in the art. Antibodies may be 
introduced into cells by a variety of methods. One exemplary method comprises 
generating a nucleic acid encoding a single chain antibody that is capable of 
disrupting a POSHPOSH-AP complex. Such a nucleic acid may be conjugated to 
25 antibody that binds to receptors on the surface of target cells. It is contemplated that 
in certain embodiments, the antibody may target viral proteins that are present on the 
surface of infected cells, and in this way deliver the nucleic acid only to infected 
cells. Once bound to the target cell surface, the antibody is taken up by endocytosis, 
and the conjugated nucleic acid is transcribed and translated to produce a single 
30 chain antibody that interacts with and disrupts the targeted POSH:POSH-AP 

complex. Nucleic acids expressing the desired single chain antibody may also be 
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introduced into cells using a variety of more conventional techniques, such as viral 
transfection (e.g., using an adenoviral system) or liposome-mediated transfection. 

- ' .Small molecules of the application may be identified for their ability to 
modulate the formation of POSH:POSH-AP complexes. 

Certain embodiments of the disclosure relate to use of a small molecule as an 
inhibitor of POSH. Examples of such small molecules include the following 

compounds: 

Compound CAS 27430-18-8: 




O 



10 



Compound CAS 1631-29-4: 
O 




Compound CAS 503065-65-4: 



15 




Compound CAS 414908-08: 
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10 



Compound CAS 415703-60-5: 
Br 




Compound CAS 77367-94-3: 
O. 




Compound CAS 154184-27-7 




In certain embodiments, compounds useful in the instant compositions and 
methods include heteroarylmethylene-dihydro-2,4,6-pyrimidinetriones and their 
thione analogs. Preferred heteroaryl moieties include 5-membered rings such as 
thienyl, furyl, pyrrolyl, oxazolyl, thiazolyl, and imidazolyl moieties. 
15 In certain embodiments, compounds useful in the instant compositions and 

methods include N-arylmaleimides, especially N-phenylmaleimides, in which the 
phenyl group may be substituted or unsubstituted. 

In certain embodiments, compounds useful in the instant compositions and 
methods include arylallylidene-2,4-imidazolidinediones and their thione analogs. 
9372369 1 
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Preferred aryl groups are phenyl groups, and both the aryl and allylidene portions of 
the molecule may be substituted or unsubstituted. 

In certain embodiments, compounds useful in the instant compositions and ■ 
methods include substituted distyryl compounds and aza analogs thereof such as 
5 substituted 1 ,4-diphenylazabutadiene compounds. 

In certain other embodiments, compounds useful in the instant compositions 
and methods include substituted styrenes and aza analogs thereof, such as 1,2- 
diphenylazaethylenes and l-phenyl-2-pyridyl-azaethelenes. 

In yet other embodiments, compounds useful in the instant compositions and 
10 methods include N-aryl-N'-acylpiperazines. In such compounds, the aryl ring, the 
acyl substituent, and/or the piperazine ring may be substituted or unsubstituted. 

In additional embodiments, compounds useful in the instant compositions 
and methods include aryl esters of (2-oxo-benzooxazol-3-yl)-acetic acid, and 
analogs thereof in which one or more oxygen atoms are replaced by sulfur atoms. 
15 In certain embodiments, the present application contemplates use of known 

PKA modulators (e.g., inhibitors or activators) in the methods of ihibiting viral 
infection and in the methods of treating or preventing cancer. Such PKA modulators 
include any compound, peptide, nucleotide derivative, nucleoside derivative, 
polysaccharide, sugar or other substance that can inhibit the activity of protein 
20 kinase A. Many PKA inhibitors are available and may be used. For example, many 
examples of PKA inhibitors including chemical structures, methods for 
administration and pharmacological effects are listed at the Calbiochem website at 
calbiochem.com. In general, inhibitors that also significantly inhibit protein kinase C 
activity are avoided. 

25 In some embodiments, the PKA inhibitor is a nucleotide or nucleoside 

derivative. Specific examples of nucleoside or nucleotide derivatives that act as 
PKA inhibitors and that can be utilized in the disclosure include adenosine 3',5' 
cyclic monophosphorothioate. The H-89 inhibitor is a potent PKA inhibitor that can 
be used in the disclosure. The chemical name for the H-89 inhibitor is N-[2- 

30 ((Pbromocinnamyl) amino)ethyl] isoquinolinesulfonamide. The KT5720 inhibitor 
from Calbiochem can also be used in the disclosure. Other PKA inhibitors which are 
available at from Calbiochem and can be used in the disclosure include ellagic acid 
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(also named 4,4\5,5\6,6 , -hexahydroxydiphenic acid 2,6,2*,6*-ditactone), piceatannol, 
l-(5-Isoquinolinesulfonyl) methylpiperazine (H-7), N-[2-(methylamino)ethyl] 

' isoquinolinesulfonamide (H-8), N-(2-aminoethyl) isoquinolinesulfonamide (H-9), 
and (5-isoquinolinesulfonyl)piperazine, 2HCI (H-100). 

5 The PKA inhibitor can also be a peptide inhibitor (PKI). Such a peptide 

inhibitor c an b e a ny p eptide t hat i s r ecognized and b ound b y P KA b ut that P KA 
cannot phosphorylate. An example of a peptide inhibitor is a peptide with a 
"consensus sequence" for PKA recognition but with alanine in place of serine, for 
example, a peptide with the following sequence: Xaa-Arg-Arg-Xaa-Ala-Xaa, 

10 wherein Xaa is any amino acid, which specifically binds to the pseudoregion of the 
regulatory domain of PKA. Myristoylated PKA inhibitor amide (14-22, Cell- 
Permeable) having the sequence Myr-N-Gly-Arg-Thr-Gly-Arg-Arg-Asn-Ala-Ile- 
NH 2 is another example of a peptide inhibitor that can be utilized in the disclosure. 
A variety of other PKI peptides can be used as an inhibitor of protein kinase A in the 

15 practice of the disclosure. For example, several PKI peptides can be found in the 
NCBI protein database. See website at ncbi.nlm.nih.gov/ 
Genbank/GenbankOverview. One example of a human PKI peptide can be-found at 
Genbank Accession No. P04541 (gi: 417194). Another example of a human PKI 
peptide is at Genbank Accession No. NP 008997 (gi: 5902020). Another PKI that 

20 can be used as an inhibitor has the following sequence: 
Ile-Ala-Ser-Gly-Arg-Thr-Gly-Arg-Arg-Asn-Ala-Ile-His-Asp-lle-Leu-Val-SerSer- 

Ala. See published PCT application WO 03/080649. 

Further examples of protein kinase A inhibitors are provided in the following 

references: Muniz et al., Proceedings of the National Academy of Sciences USA 
25 1997 Dec 23; 94(26) 14461-66; Baude et al., Journal of Biological Chemistry Vol. 

269 issue 27 18128-18133 (Jul. 1994); Scott et al. 

Applicants found that POSH is phosphorylated by PKA and phosphorylation 

of POSH by PKA can inhibit POSH function, for example dissociating POSH from 

POSH interacting proteins (e.g, Rac). Therefore, in certain embodiments, the present 
30 disclosure also cotemplates use of PKA activators in treating or preventing a POSH- 

associated disease (e.g., viral infection or cancer). Exemplary PKA activators 

include, but are not limited to, forskolin, 8-Br-cAMP, and rolipram. 
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In additional embodiments of the application, compounds useful in the 
present application include phosphatase inhibitors. Phosphatase inhibitors useful in 
the subject application include sodium phosphate, sodium vanadate, and okadaic 
acid. In certain embodiments, the present application contemplates use of known 

5 phosphatase inhibitors in the methods of inhibiting viral infection, in the methods of 
treating or preventing cancer, and in the methods of inhibiting the progression of a 
neurodenerative disorder. Phosphatase inhibitors may be useful in inhibiting the 
activity of a POSH-AP, such as for example, PTPN12. 

For POSH-APs that are GTPases, inhibitors such as GTPgamma35S would 

10 be effective at inhibiting the GTPase activity of the POSH-AP. For example, 
inhibition of ARF1 or ARF5 could be accomplished with the use of a GTPase 
inhibitor such as GTPgamma35S, a non-hydrolyzable form of GTP. 

The generation o f n ucleic a cid b ased t herapeutic a gents d irected t o P OSH 
and POSH-APs is described below. 

1 5 Methods for identifying and evaluating further modulators of POSH and 

POSH-APs are also provided below. 

5. RNA Interference^ Ribozvmes, Antisense and Related Constructs 

In certain aspects, the application relates to RNAi, ribozyme, antisense and 
20 other nucleic acid-related methods and compositions for manipulating (typically 
decreasing) a POSH activity. Exemplary RNAi and ribozyme molecules may 
comprise a sequence as shown in any of SEQ ED Nos: 15, 16, 18, 19, 21, 22, 24 and 
25. 

In certain aspects, the application relates to RNAi, ribozyme, antisense and 
25 other nucleic acid-related methods and compositions for manipulating (typically 
decreasing) a POSH-AP activity. Specific instances of nucleic acids that may be 
used to design nucleic acids for RNAi, ribozyme, antisense are provided in Figure 
36. Additionally, nucleic acids of POSH-APs listed in Table 8 may be used to 
design nucleic acids for RNAi, ribozyme, antisense. 
30 Certain embodiments of the application make use of materials and methods 

for e ffecting k nockdown o f o ne o r m ore P OSH or P OSH- AP genes b y means o f 
RNA interference (RNAi). RNAi is a process of sequence-specific post- 
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transcriptional gene repression which can occur in eukaryotic cells. In general, this 
process involves degradation of an mRNA of a particular sequence induced by 
double-stranded RNA (dsRNA) that is homologous to that sequence. For example, 
the expression of a long dsRNA corresponding to the sequence of a particular single- 
5 stranded mRNA (ss mRNA) will labilize that message, thereby "interfering" with 
expression of the corresponding gene. Accordingly, any selected gene may be 
repressed by introducing a dsRNA which corresponds to all or a substantial part of 
the m RNA for t hat gene. It a ppears t hat w hen a 1 ong d sRNA i s expressed, i t i s 
initially processed by a ribonuclease III into shorter dsRNA oligonucleotides of as 
10 few as 21 to 22 base pairs in length. Furthermore, Accordingly, RNAi may be 
effected by introduction or expression of relatively short homologous dsRNAs. 
Indeed the use of relatively short homologous dsRNAs may have certain advantages 
as discussed below. 

Mammalian cells have at least two pathways that are affected by double- 
15 stranded RNA ( dsRNA). In the RNAi (sequence-specific) pathway, the initiating 
dsRNA is first broken into short interfering (si) RNAs, as described above. T he 
siRNAs have sense and antisense strands of about 21 nucleotides that form 
approximately 19 nucleotide si RNAs with overhangs of two nucleotides at each 3' 
end. Short interfering RNAs are thought to provide the sequence information that 
20 allows a specific messenger RNA to be targeted for degradation. In contrast, the 
nonspecific pathway is triggered by dsRNA of any sequence, as long as it is at least 
about 30 base pairs in length. The nonspecific effects occur because dsRNA 
activates two enzymes: PKR, which in its active form phosphorylates the translation 
initiation factor eEF2 to shut down all protein synthesis, and 2\ 5' oligoadenylate 
25 synthetase (2', 5 '-AS), which synthesizes a molecule that activates Rnase L, a 
nonspecific enzyme that targets all mRNAs. The nonspecific pathway may 
represent a host response to stress or viral infection, and, in general, the effects of 
the nonspecific pathway are preferably minimized under preferred methods of the 
present application. Significantly, longer dsRNAs appear to be required to induce 
30 the nonspecific pathway and, accordingly, dsRNAs shorter than about 30 bases pairs 
are preferred to effect gene repression by RNAi (see Hunter et al. (1975) J Biol 
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Chem 250: 409-17; Manche et al. (1992) Mol Cell Biol 12: 5239-48; Minks et al. 
(1979) J Biol Chem 254: 10180-3; and Elbashir et al. (2001) Nature 411: 494-8). 
. " _jRNAi has been shown to be effective in reducing or ehminating the • 
expression of genes in a number of different organisms including Caenorhabditiis 
5 elegans (see e.g., Fire et al. (1998) Nature 391: 806-11), mouse eggs and embryos 
(Wianny et al. (2000) Nature Cell Biol 2: 70-5; Svoboda et al. (2000) Development 
127: 4147-56), and cultured RAT-1 fibroblasts (Bahramina et al. (1999) Mol Cell 
Biol 19: 274-83), and appears to be an anciently evolved pathway available in 
eukaryotic plants and animals (Sharp (2001) Genes Dev. 15: 485-90). RNAi has 
10 proven to be an effective means of decreasing gene expression in a variety of cell 
types including HeLa cells, NIH/3T3 cells, COS cells, 293 cells and BHK-21 cells, 
and typically decreases expression of a gene to lower levels than that achieved using 
antisense techniques and, indeed, frequently eliminates expression entirely (see Bass 
(2001) Nature 411: 428-9). In mammalian cells, siRNAs are effective at 
15 concentrations that are several orders of magnitude below the concentrations 
typically used in antisense experiments (Elbashir et al. (2001) Nature 41 1 : 494-8). 

The double stranded oligonucleotides used to effect RNAi are preferably less 
than 30 base pairs in length and, more preferably, comprise about 25, 24, 23, 22, 21, 
20, 19, 18 or 17 base pairs of ribonucleic acid. Optionally the dsRNA 
20 oligonucleotides o f t he application m ay i nclude 3 ' o verhang ends. E xemplary 2 - 
nucleotide 3' overhangs maybe composed of ribonucleotide residues of any type 
and may even be composed of 2'-deoxythymidine resides, which lowers the cost of 
RNA synthesis and may enhance nuclease resistance of siRNAs in the cell culture 
medium and within transfected cells (see Elbashir et al. (2001) Nature 411: 494-8). 
25 Longer dsRNAs of 50, 75, 100 or even 500 base pairs or more may also be utilized 
in certain embodiments of the application. Exemplary concentrations of dsRNAs for 
effecting RNAi are about 0.05 nM, 0.1 nM, 0.5 nM, 1.0 nM, 1.5 nM, 25 nM or 100 
nM, although other concentrations may be utilized depending upon the nature of the 
cells treated, the gene target and other factors readily discemable the skilled artisan. 
30 Exemplary dsRNAs may be synthesized chemically or produced in vitro or in vivo 
using appropriate expression vectors. Exemplary synthetic RNAs include 21 
nucleotide RNAs chemically synthesized using methods known in the art (e.g., 
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Expedite RNA phophoramidites and thymidine phosphoramidite (Proligo, 
Germany). S ynthetic oligonucleotides are preferably deprotected and gel-purified 
using methods known in the art (see e.g., Elbashir et al. (2001) Genes Dev. 15: 188- 
200). Longer RNAs may be transcribed from promoters, such as T7 RNA 
5 polymerase promoters, known in the art. A single RNA target, placed in both 
possible orientations downstream of an in vitro promoter, will transcribe both 
strands of the target to create a dsRNA oligonucleotide of the desired target 
sequence. Any of the above RNA species will be designed to include a portion of 
nucleic acid sequence represented in a POSH or POSH-AP nucleic acid, such as, for 
10 example, a nucleic acid that hybridizes, under stringent and/or physiological 
conditions, to any of SEQ ID Nos: 1, 3, 4, 6, 8 and 10 and complements thereof or 
any of the POSH-AP sequences presented in Figure 36. 

The specific sequence utilized in design of the oligonucleotides may be any 
contiguous sequence of nucleotides contained within the expressed gene message of 
15 the target. Programs and algorithms, known in the art, may be used to select 
appropriate target sequences. In addition, optimal sequences may be selected 
utilizing programs designed to predict the secondary structure of a. specified single 
stranded nucleic acid sequence and allowing selection of those sequences likely to 
occur in exposed single stranded regions of a folded mRNA. Methods and 
20 compositions for designing appropriate oligonucleotides may be found, for example, 
in U.S. Patent Nos. 6,251,588, the contents of which are incorporated herein by 
reference. Messenger RNA (mRNA) is generally thought of as a linear molecule 
which contains the information for directing protein synthesis within the sequence of 
ribonucleotides, however studies have revealed a number of secondary and tertiary 
25 structures that exist in most mRNAs. Secondary structure elements in RNA are 
formed largely by Watson-Crick type interactions between different regions of the 
same RNA molecule. Important secondary structural elements include 
intramolecular d ouble s tranded r egions, h airpin 1 oops, b ulges i n d uplex RNA and 
internal loops. Tertiary structural elements are formed when secondary structural 
30 elements come in contact with each other or with single stranded regions to produce 
a more complex three dimensional structure. A number of researchers have 
measured the binding energies of a large number of RNA duplex structures and have 

9372369 1 

-67- 



Attorney Docket No.: PROL-PWO-024 

C T/' U SO "+/ 01 6 3 O 8 



derived a set of rules which can be used to predict the secondary structure of RNA 
(see e.g., Jaeger et al. (1989) Proc. Natl. Acad. Sci. USA 86:7706 (1989); and 
Turner et al. (1988) Annu. Rev. Biophys. Biophys. Chem. 17:167) . The rules are 
useful in identification of RNA structural elements and, in particular, for identifying 
5 single stranded RNA regions which may represent preferred segments of the mRNA 
to target for silencing RNAi, ribozyme or antisense technologies. Accordingly, 
preferred segments of the mRNA target can be identified for design of the RNAi 
mediating dsRNA oligonucleotides as well as for design of appropriate ribozyme 
and hammerheadribozyme compositions of the application. 
10 The dsRNA oligonucleotides may be introduced into the cell by transfection 

with an heterologous target gene using carrier compositions such as liposomes, 
which are known in the art- e.g., Lipofectamine 2000 (Life Technologies) as 
described by the manufacturer for adherent cell lines. Transfection of dsRNA 
oligonucleotides for targeting endogenous genes may be carried out using 
15 Oligofectamine (Life Technologies). Transfection efficiency may be checked using 
fluorescence microscopy for mammalian cell lines after co-transfection of hGFP- 
encoding pAD3 (Kehlenback et al. (1998) J Cell Biol 141: 863-74). The 
effectiveness of the RNAi may be assessed by any of a number of assays following 
introduction of the dsRNAs. These include Western blot analysis using antibodies 
20 which recognize the POSH or POSH-AP gene product following sufficient time for 
turnover o f t he e ndogenous p ool a fter n ew p rotein s ynthesis i s r epressed, r everse 
transcriptase polymerase chain reaction and Northern blot analysis to determine the 
level of existing POSH or POSH-AP target mRNA. 

Further compositions, methods and applications of RNAi technology are 
25 provided in U.S. Patent Application Nos. 6,278,039, 5,723,750 and 5,244,805, 
which are incorporated herein by reference. 

Ribozyme molecules designed to catalytically cleave POSH or POSH-AP 
mRNA transcripts can also be used to prevent translation of suject POSH or POSH- 
AP mRNAs and/or expression of POSH or POSH-APs (see, e.g., PCT International 
30 Publication WO90/11364, published October 4, 1990; Sarver et al. (1990) Science 
247:1222-1225 and U.S. Patent No. 5,093,246). Ribozymes are enzymatic RNA 
molecules capable of catalyzing the specific cleavage of RNA. (For a review, see 
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Rossi (1994) Current Biology 4: 469-471). The mechanism of ribozyme action 
involves sequence specific hybridization of the ribozyme molecule to 
complementary target RNA, followed by an endonucleolytic cleavage event. The 
composition of ribozyme molecules preferably includes one or more sequences 
> complementary to a POSH or POSH-AP mRNA, and the well known catalytic 
sequence responsible for mRNA cleavage or a functionally equivalent -sequence 
(see, e.g., U.S. Pat. No. 5,093,246, which is incorporated herein by reference in its 
entirety). 

While ribozymes that cleave mRNA at site specific recognition sequences 
0 can be used to destroy target mRNAs, the use of hammerhead ribozymes is 
preferred. Hammerhead ribozymes cleave mRNAs at locations dictated by flanking 
regions that form complementary base pairs with the target mRNA. Preferably, the 
target mRNA has the following sequence of two bases: 5'-UG-3\ The construction 
and production of hammerhead ribozymes is well known in the art and is described 
5 more fully in Haseloff and Gerlach ((1988) Nature 334:585-591; and see PCT 
Appln. No. WO89/05852, the contents of which are incorporated herein by 
reference). H ammerhead r ibozyme s equences c an b e e mbedded i n a s table RNA 
such as a transfer RNA (tRNA) to increase cleavage efficiency in vivo (Perriman et 
al. (1995) Proc. Natl. Acad. Sci. USA, 92: 6175-79; de Feyter, and Gaudron, 
20 Methods in Molecular Biology, Vol. 74, Chapter 43, "Expressing Ribozymes in 
Plants", E dited b y T urner, P . C , H umana P ress Inc., T otowa, N .J). In p articular, 
RNA polymerase Ill-mediated expression of tRNA fusion ribozymes are well 
known in the art ( see Kawasaki et al. (1998) Nature 393: 284-9; Kuwabara et al. 
(1998) Nature Biotechnol. 16: 961-5; and Kuwabara et al. (1998) Mol. Cell 2: 617- 
25 27; Koseki et al. (1999) J Virol 73: 1868-77; Kuwabara et al. (1999) Proc Natl Acad 
Sci USA 96: 1886-91; Tanabe et al. (2000) Nature 406: 473-4). There are typically 
a number of potential hammerhead ribozyme cleavage sites within a given target 
cDNA sequence. Preferably the ribozyme is engineered so that the cleavage 
recognition site is located near the 5' end of the target mRNA- to increase efficiency 
30 and minimize the intracellular accumulation of non-functional mRNA transcripts. 
Furthermore, the use of any cleavage recognition site located in the target sequence 
encoding different portions of the C-terminal amino acid domains of, for example, 

9372369_1 

-69- 



Attorney Docket No.: PROL-PWO-024 

C T/ U S O Mh-' O B 3 O 8 

long and short forms of target would allow the selective targeting of one or the other 
form of the target, and thus, have a selective effect on one form of the target gene 
product. 

Gene targeting ribozymes necessarily contain a hybridizing region 
5 complementary to two regions, each of at least 5 and preferably each 6, 7, 8, 9, 10, 
11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 contiguous nucleotides in length of a POSH 
or POSH-AP mRNA, such as an mRNA of a sequence represented in any of SEQ ID 
Nos: 1, 3, 4, 6, 8 or 10 or a POSH-AP presented in Figure 36. In addition, 
ribozymes possess highly specific endoribonuclease activity, which autocatalytically 
0 cleaves the target sense mRNA. The present application extends to ribozymes 
which hybridize to a sense mRNA encoding a POSH gene such as a therapeutic drug 
target candidate gene, thereby hybridising to the sense mRNA and cleaving it, such 
that it is no longer capable of being translated to synthesize a functional polypeptide 
product. 

15 The ribozymes of the present application also include RNA 

endoribonucleases (hereinafter "Cech-type ribozymes") such as the one which 
occurs naturally in Tetrahymena thermophila (known as the IVS, or L-19 IVS RNA) 
and which has been extensively described by Thomas Cech and collaborators (Zaug, 
et al. (1984) Science 224:574-578; Zaug, et al. (1986) Science 231:470-475; Zaug, 

20 et al. (1986) Nature 324:429-433; published International patent application No. 
WO88/04300 by University Patents Inc.; Been, et al. (1986) Cell 47:207-216). The 
Cech-type ribozymes have an eight base pair active site which hybridizes to a target 
RNA sequence whereafter cleavage of the target RNA takes place. The application 
encompasses those Cech-type ribozymes which target eight base-pair active site 

25 sequences that are present in a target gene or nucleic acid sequence. 

Ribozymes can be composed of modified oligonucleotides (e.g., for 
improved stability, targeting, etc.) and should be delivered to cells which express the 
target gene in vivo. A preferred method of delivery involves using a DNA construct 
"encoding" the ribozyme under the control of a strong constitutive pol III or pol II 

30 promoter, so that transfected cells will produce sufficient quantities of the ribozyme 
to destroy endogenous target messages and inhibit translation. Because ribozymes, 
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unlike antisense molecules, are catalytic, a lower intracellular concentration is 

required for efficiency. 

In certain embodiments, a ribozyme may be designed by first identifying a 
sequence portion sufficient to cause effective knockdown by RNAi. The same 
sequence portion may then be incorporated into a ribozyme. In this aspect of the 
application, the gene-targeting portions of the ribozyme or RNAi are substantially 
the same sequence of at least 5 and preferably 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, 16, 
17, 18, 19 or 20 or more contiguous nucleotides of a POSH nucleic acid, such as a 
nucleic acid of any of SEQ ID Nos: 1, 3, 4, 6, 8, or 10 or POSH-AP nucleic acid, as 
presented in Figure 36. In a long target RNA chain, significant numbers of target 
sites are not accessible to the ribozyme because they are hidden within secondary or 
tertiary structures (Birikh et al. (1997) Eur J Biochem 245: 1-16). To overcome the 
problem of target RNA accessibility, computer generated predictions of secondary 
structure are typically used to identify targets that are most likely to be single- 
stranded or have an "open" configuration (see Jaeger et al. (1989) Methods Enzymol 
183: 281-306). Other approaches utilize a systematic approach to predicting 
secondary structure which involves assessing a huge number of candidate 
hybridizing oligonucleotides molecules (seeMilner et al. (1997) Nat Biotechnol 15: 
537-41; and Patzel and Sczakiel (1998) Nat Biotechnol 16: 64-8). Additionally, U.S. 
Patent No. 6,251,588, the contents of which are hereby incorporated herein, 
describes methods for evaluating oligonucleotide probe sequences so as to predict 
the potential for hybridization to a target nucleic acid sequence. The method of the 
application provides for the use of such methods to select preferred segments of a 
target mRNA sequence that are predicted to be single-stranded and, further, for the 
opportunistic utilization of the same or substantially identical target mRNA 
sequence, preferably comprising about 10-20 consecutive nucleotides of the target 
mRNA, in the design of both the RNAi oligonucleotides and ribozymes of the 
application. 

A further aspect of the application relates to the use of the isolated 
"antisense" nucleic acids to inhibit expression, e.g., by inhibiting transcription 
and/or translation of a POSH or POSH-AP nucleic acid. The antisense nucleic acids 
may bind to the potential drug target by conventional base pair complementarity, or, 
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for example, in the case of binding to DNA duplexes, through specific interactions 
in the major groove of the double helix. In general, these methods refer to the range 
of "techniques generally employed in the art, and include any methods that rely on 
specific binding to oligonucleotide sequences. 
5 An antisense construct of the present application can be delivered, for 

example, as an expression plasmid which, when transcribed in the cell, produces 
RNA which is complementary to at least a unique portion of the cellularmRNA 
which encodes a POSH or POSH-AP polypeptide. Alternatively, the antisense 
construct is an oligonucleotide probe, which is generated ex vivo and which, when 
10 introduced into the cell causes inhibition of expression by hybridizing with the 
mRNA and/or genomic sequences of a POSH or POSH-AP nucleic acid. Such 
oligonucleotide probes are preferably modified oligonucleotides, which are resistant 
to endogenous nucleases, e.g., exonucleases and/or endonucleases, and are therefore 
stable in vivo. Exemplary nucleic acid molecules for use as antisense 
15 oligonucleotides are phosphoramidate, phosphothioate and methylphosphonate 
analogs of DNA (see also U.S. Patents 5,176,996; 5,264,564; and 5,256,775). 
Additionally, general approaches to constructing oligomers useful in antisense 
therapy have been reviewed, for example, by Van der Krol et al. (1988) 
BioTechniques 6:958-976; and Stein et al. (1988) Cancer Res 48:2659- 2668. 
20 With respect to antisense DNA, oligodeoxyribonucleotides derived from the 

translation initiation site, e.g., between the -10 and +10 regions of the target gene, 
are preferred. Antisense approaches involve the design of oligonucleotides (either 
DNA or RNA) that are complementary to mRNA encoding a POSH or POSH-AP 
polypeptide. The antisense oligonucleotides will bind to the mRNA transcripts and 
25 prevent translation. Absolute complementarity, although preferred, is not required. 
In the case of double-stranded antisense nucleic acids, a single strand of the duplex 
DNA may thus be tested, or triplex formation may be assayed. The ability to 
hybridize will depend on both the degree of complementarity and the length of the 
antisense nucleic acid. Generally, the longer the hybridizing nucleic acid, the more 
30 base mismatches with an RNA it may contain and still form a stable duplex (or 
triplex, as the case may be). One skilled in the art can ascertain a tolerable degree of 
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mismatch by use of standard procedures to determine the melting point of the 

hybridized complex. 

Oligonucleotides that are complementary to the 5' end of the mRNA, e.g., the 
5' untranslated sequence up to and including the AUG initiation codon, should work 
5 most efficiently at inhibiting translation. However, sequences complementary to the 
3' u translated s equences o f m RN As h ave r ecently b een s hown t o b e e ffective a t 
inhibiting translation of mRNAs as well. (Wagner, R. 1994. Nature 372:333). 
Therefore, oligonucleotides complementary to either the 5" or 3' untranslated, non- 
coding regions of a gene could be used in an antisense approach to inhibit translation 
1 0 of that mRNA. Oligonucleotides, complementary to the 5' untranslated region of the 
mRNA should include the complement of the AUG start codon. Antisense 
oligonucleotides complementary to mRNA coding regions are less efficient 
inhibitors of translation but could also be used in accordance with the application. 
Whether designed to hybridize to the 5', 3' or coding region of mRNA, antisense 
1 5 nucleic acids should be at least six nucleotides in length, and are preferably less that 
about 100 and more preferably less than about 50, 25, 17 or 10 nucleotides in length. 

It is preferred that in vitro studies are first performed to quantitate the ability 
of the antisense oligonucleotide to inhibit gene expression. It is preferred that these 
studies utilize controls that distinguish between antisense gene inhibition and 
20 nonspecific biological effects of oligonucleotides. It is also preferred that these 
studies compare levels of the target RNA or protein with that of an internal control 
RNA or protein. Results obtained using the antisense oligonucleotide may be 
compared with those obtained using a control oligonucleotide. It is preferred that 
the control oligonucleotide is of approximately the same length as the test 
25 oligonucleotide and that the nucleotide sequence of the oligonucleotide differs from 
the antisense sequence no more than is necessary to prevent specific hybridization to 
the target sequence. 

The antisense oligonucleotides can be DNA or RNA or chimeric mixtures or 
derivatives or modified versions thereof, single-stranded or double-stranded. T he 
30 oligonucleotide can be modified at the base moiety, sugar moiety, or phosphate 
backbone, for example, to improve stability of the molecule, hybridization, etc. The 
oligonucleotide may include other appended groups such as peptides (e.g., for 
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targeting host cell receptors), or agents facilitating transport across the cell 
membrane (see, e.g., Letsinger et al., 1989, Proc. Natl. Acad. Sci. U.S.A. 86:6553- 
6556; Lemaitre et al., 1987, Proc. Natl. Acad. Sci. 84:648-652; PCT Publication No. 
W088/09810, published December 15, 1988) or the blood- brain barrier (see, e.g., 
5 PCT Publication No. W089/10134, published April 25, 1988), hybridization- 
triggered cleavage agents. (See, e.g., Krol et al., 1988, BioTechniques 6:958- 976) 
or intercalating agents. (See, e.g., Zon, 1988, Pharm. Res. 5:539-549). To this end, 
the oligonucleotide may be conjugated to another molecule, e.g., a peptide, 
hybridization triggered cross-linking agent, transport agent, hybridization-triggered 
1 0 cleavage agent, etc. 

The antisense oligonucleotide may comprise at least one modified base 
moiety which is selected from the group including but not limited to 5-fluorouracil, 
5- bromouracil, 5-chlorouraciI, 5-iodouracil, hypoxanthine, xantine, 4- 
acetylcytosine, 5- (carboxyhydroxytiethyl) uracil, 5-carboxymethylaminomethyl-2- 
15 thiouridine, 5- carboxymethylaminomethyluracil, dihydrouracil, beta-D- 
galactosylqueosine, inosine, N6- isopentenyladehine, 1-methylguanine, 1- 
methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3- 
methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5- 
methylaminomethyluracil, 5-memoxyaminomethyl-2-thiouracil, beta-D- 
3 mannosylqueosine, S'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 
N6- isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5- 
methyluracil, uracil-5- oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5- 
methyI-2-thiouracil, 3-(3-amino-3- N-2-carboxypropyl) uracil, (acp3)w, and 2,6- 
> diaminopurine. 

The antisense oligonucleotide may also comprise at least one modified sugar 
moiety selected from the group including but not limited to arabinose, 2- 
fluoroarabinose, xylulose, and hexose. 

The antisense oligonucleotide can also contain a neutral peptide-like 
backbone. Such molecules are termed peptide nucleic acid (PNA)-oIigomers and 
are described, e.g., in Perry-O'Keefe et al. (1996) Proc. Natl. Acad. Sci. U.S.A. 
93:14670 and in Eglom et al. (1993) Nature 365:566. One advantage of PNA 
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oligomers is their capability to bind to complementary DNA essentially 
independently from the ionic strength of the medium due to the neutral backbone of 
^ the DNA. In yet another embodiment, the antisense oligonucleotide comprises at 
least one modified phosphate backbone selected from the group consisting of a 
5 phosphorothioate, a phosphorodithioate, a phosphoramidothioate, a 
phosphoramidate, a phosphordiamidate, a methylphosphonate, aif alkyl 
phosphotriester, and a formacetal or analog thereof. 

In yet a further embodiment, the antisense oligonucleotide is an alpha- 
anomeric oligonucleotide. An alpha-anomeric oligonucleotide forms specific 
10 double-stranded hybrids with complementary RNA in which, contrary to the usual 
antiparallel orientation, the strands run parallel to each other (Gautier et al., 1987, 
Nucl. Acids Res. 15:6625-6641). The oligonucleotide is a 2 , -0-methylribonucleotide 
(Inoue et al., 1987, Nucl. Acids Res. 15:6131-6148), or a chimeric RNA-DNA 
analogue (Inoue et al., 1987, FEBS Lett. 215:327-330). 
15 While antisense nucleotides complementary to the coding region of a POSH 

or POSH-AP mRNA sequence can be used, those complementary to the transcribed 
untranslated region may also be used. 

In certain instances, it may be difficult to achieve intracellular concentrations 
of the antisense sufficient to suppress translation on endogenous mRNAs. Therefore 
20 a preferred approach utilizes a recombinant DNA construct in which the antisense 
oligonucleotide is placed under the control of a strong pol III or polll promoter. 
The use of such a construct to transfect target cells will result in the transcription of 
sufficient amounts of single stranded RNAs that will form complementary base pairs 
with the endogenous potential drug target transcripts and thereby prevent translation. 
25 For example, a vector can be introduced such that it is taken up by a cell and directs 
the transcription of an antisense RNA. Such a vector can remain episomal or 
become chromosomally integrated, as long as it can be transcribed to produce the 
desired antisense RNA. Such vectors can be constructed by recombinant DNA 
technology methods standard in the art. Vectors can be plasmid, viral, or others 
30 known in the art, used for replication and expression in mammalian cells. 
Expression o f the sequence encoding the antisense RNA can b e by any promoter 
known in the art to act in mammalian, preferably human cells. Such promoters can 
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be inducible or constitutive. Such promoters include but are not limited to: the 
SV40 early promoter region (Bemoist and Chambon, 1 981, Nature 2 90:304-310), 
the promoter contained in the 3* long terminal repeat of Rous sarcoma virus 
(Yamamoto et al., 1980, Cell 22:787-797), the herpes thymidine kinase promoter 
5 (Wagner et al., 1981, Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445), the regulatory 
sequences of the metallothionein gene (Brinster et al, 1982, Nature 296:39-42), etc. 
Any type of plasmid, cosmid, YAC or viral vector can be used to prepare the 
recombinant DNA construct, which can be introduced directly into the tissue site. 

Alternatively, POSH or POSH-AP gene expression can be reduced by 
10 targeting deoxyribonucleotide sequences complementary to the regulatory region of 
the gene (i.e., the promoter and/or enhancers) to form triple helical structures that 
prevent transcription of the gene in target cells in the body. (See generally, Helene, 
C. 1991, Anticancer Drug Des., 6(6):569-84; Helene, C, et al., 1992, Ann. N.Y. 
Acad. Sci., 660:27-36; and Maher, L.J., 1992, Bioassays 14(12):807-15). 
15 Nucleic acid molecules to be used in triple helix formation for the inhibition 

of transcription are preferably single stranded and composed of 
deoxyribonucleotides. The base composition of these oligonucleotides should 
promote triple helix formation via Hoogsteen base pairing rules, which generally 
require sizable stretches of either purines or pyrimidines to be present on one strand 
20 of a duplex. Nucleotide sequences may be pyrimidine-based, which will result in 
TAT and CGC triplets across the three associated strands of the resulting triple 
helix. The pyrimidine-rich molecules provide base complementarity to a purine-rich 
region of a single strand of the duplex in a parallel orientation to that strand. In 
addition, nucleic acid molecules may be chosen that are purine- rich, for example, 
25 containing a stretch of G residues. These molecules will form a triple helix with a 
DNA duplex that is rich in GC pairs, in which the majority of the purine residues are 
located on a single strand of the targeted duplex, resulting in CGC triplets across the 
three strands in the triplex. 

Alternatively, POSH or POSH-AP sequences that can be targeted for triple 
30 helix formation may be increased by creating a so called "switchback" nucleic acid 
molecule. Switchback molecules are synthesized in an alternating 5'-3', 3'-5' 
manner, such that they base pair with first one strand of a duplex and then the other, 
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eliminating the necessity for a sizable stretch of either purines or pyrimidines to be 
present on one strand of a duplex. 

A further aspect of the application relates to the use of DNA enzymes to 
inhibit expression of a POSH or POSH-AP gene. DNA enzymes incorporate some 
5 of the mechanistic features of both antisense and ribozyme technologies. DNA 
enzymes are designed so that they recognize a particular target nucleic acid 
sequence, much like an antisense oligonucleotide, however much like a ribozyme 
they are catalytic and specifically cleave the target nucleic acid. 

There are currently two basic types of DNA enzymes, and both of these were 
10 identified by Santoro and Joyce (see, for example, US Patent No. 6110462). The 
10-23 DNA enzyme comprises a loop structure which connect two arms. The two 
arms provide specificity by recognizing the particular target nucleic acid sequence 
while the loop structure provides catalytic function under physiological conditions. 

Briefly, to design an ideal DNA enzyme that specifically recognizes and 
15 cleaves a target nucleic acid, one of skill in the art must first identify the unique 
target sequence. This can be done using the same approach as outlined for antisense 
oligonucleotides. Preferably, the unique or substantially sequence is a G/G rich of 
approximately 18 to 22 nucleotides. High G/C content helps insure a stronger 
interaction between the DNA enzyme and the target sequence. 
20 When synthesizing the DNA enzyme, the specific antisense recognition 

sequence that will target the enzyme to the message is divided so that it comprises 
the two arms of the DNA enzyme, and the DNA enzyme loop is placed between the 
two specific arms. 

Methods of making and administering DNA enzymes can be found, for 
25 example, in US 61 10462. Similarly, methods of delivery DNA ribozymes in vitro or 
in vivo include methods of delivery RNA ribozyme, as outlined in detail above. 
Additionally, one of skill in the art will recognize that, like antisense 
oligonucleotide, DNA enzymes can be optionally modified to improve stability and 
improve resistance to degradation. 
30 Antisense RNA and DNA, ribozyme, RNAi and triple helix molecules of the 

application may be prepared by any method known in the art for the synthesis of 
DNA and RNA molecules. These include techniques for chemically synthesizing 

9372369_1 

-77- 



I 



oligodeoxyribonucleotides and oligoribonucleotides well known in the art such as 
for example solid phase phosphoramidite chemical synthesis. Alternatively, RNA 
molecules may be generated by in vitro and in vivo transcription of DNA sequences 
encoding the antisense RNA molecule. Such DNA sequences may be incorporated 
into a wide variety of vectors which incorporate suitable RNA polymerase 
promoterssuch as the T 7 or SP6 polymerase promoters. Alternatively, antisense 
cDNA constructs that synthesize antisense RNA constitutively or inducibly, 
depending on the promoter used, can be introduced stably into cell lines. Moreover, 
various well-known modifications to nucleic acid molecules may be introduced as a 
means of increasing intracellular stability and half-life. Possible modifications 
include but are not limited to the addition of flanking sequences of ribonucleotides 
or deoxyribonucleotides to the 5' and/or 3' ends of the molecule or the use of 
phosphorothioate or 2' O-methyl rather than phosphodiesterase linkages within the 
oligodeoxyribonucleotide backbone. 

6. Drue Screening Assays 

In certain aspects, the present application provides assays for identifying 
therapeutic agents which either interfere with or promote POSH or POSH-AP 
function. In certain aspects, the present application also provides assays for 
identifying therapeutic agents which either interfere w ith or promote the complex 
formation between a POSH polypeptide and a POSH-AP polypeptide. 

In certain embodiments, agents of the application are antiviral agents, 
optionally interfering with viral maturation, and preferably where the virus is an 
envelope virus, and optionally a retroid virus or an RNA virus. In other 
embodiments, agents of the application are anticancer agents. In further 
embodiments, agents of the application inhibit the progression of a 
neurodegenerative disorder. In certain embodiments, an antiviral or anticancer agent 
or an agent that inhibits the progression of a neurodegenerative disorder interferes 
with the ubiquitin ligase catalytic activity of POSH (e.g., POSH auto-ubiquitination 
or transfer to a target protein). In other embodiments, agents disclosed herein inhibit 
or promote POSH and POSH-AP mediated cellular processes such as apoptosis and 
protein processing in the secretory pathway. 
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In certain preferred embodiments, an antiviral agent interferes with the 
interaction b etween P OSH a nd a P OSH-AP p olypeptide, for e xample ana nti viral 
agent may disrupt or render irreversible interaction between a POSH polypeptide 
and POSH-AP polypeptide (as in the case of a POSH dimer, a heterodimer of two 
5 different POSH polypeptides, homomultimers and heteromultimers). In further 
embodiments, agents of the application are anti-apoptotic agents, optionally 
interfering with INK and/or NF-kB signaling. In yet additional embodiments, 
agents of the application interfere with the signaling of a GTPase, such as Rac or 
Ras, optionally disrupting the interaction between a POSH polypeptide and a Rac 
10 protein. In certain embodiments, agents of the application modulate the ubiquitin 
ligase activity of POSH and may be used to treat certain diseases related to ubiquitin 
ligase activity. In certain embodiments, agents of the application interfere with the 
trafficking of a protein through the secretory pathway. 

In certain embodiments, the application provides assays to identify, optimize 
15 or otherwise assess agents that increase or decrease a ubiquitin-related activity of a 
POSH polypeptide. Ubiquitin-related activities of POSH polypeptides may include 
the self-ubiquitination activity of a POSH polypeptide, generally involving the 
transfer of ubiquitin from an E2 enzyme to the POSH polypeptide, and the 
ubiquitination of a target protein, generally involving the transfer of a ubiquitin from 
20 a POSH polypeptide to the target protein. In certain embodiments, a POSH activity 
is mediated, at least in part, by a POSH RING domain. 

In certain embodiments, an assay comprises forming a mixture comprising a 
POSH polypeptide, an E2 polypeptide and a source of ubiquitin (which may be the 
E2 polypeptide pre-complexed with ubiquitin). Optionally the mixture comprises an 
25 El polypeptide and optionally the mixture comprises a target polypeptide. 
Additional components of the mixture may be selected to provide conditions 
consistent with the ubiquitination of the POSH polypeptide. One or more of a 
variety of parameters may be detected, such as POSH-ubiquitin conjugates, E2- 
ubiquitin thioesters, free ubiquitin and target polypeptide-ubiquitin complexes. The 
30 term "detect" is used herein to include a determination of the presence or absence of 
the subject of detection (e.g., POSH-ubiqutin, E2-ubiquitin, etc.), a quantitative 
measure of the amount of the subject of detection, or a mathematical calculation of 
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the presence, absence or amount of the subject of detection, based on the detection 
of other parameters. The term "detect" includes the situation wherein the subject of 
detection is determined to be absent or below the level of sensitivity. Detection may • 
comprise detection of a label (e.g., fluorescent label, radioisotope label, and other 
5 described below), resolution and identification by size (e.g., SDS-PAGE, mass 
spectroscopy), purification and detection, and other methods mat, in view of this 
specification, will be available to one of skill in the art. For instance, radioisotope 
labeling may be measured by scintillation counting, or by densitometry after 
exposure to a photographic emulsion, or by using a device such as a 
10 Phosphorimager. Likewise, densitometry may be used to measure bound ubiquitin 
following a reaction with an enzyme label substrate that produces an opaque product 
when an enzyme label is used. In a preferred embodiment, an assay comprises 
detecting the POSH-ubiquitin conjugate. 

In certain embodiments, an assay comprises forming a mixture comprising a 
1 5 POSH polypeptide, a target polypeptide and a source of ubiquitin (which may be the 
POSH polypeptide pre-complexed with ubiquitin). Optionally the mixture 
comprises an El and/or E2 polypeptide and optionally the mixture comprises an E2- 
ubiquitin thioester. Additional components of the mixture may be selected to 
provide conditions consistent with the ubiquitination of the target polypeptide. One 
20 or more of a variety of parameters may be detected, such as POSH-ubiquitin 
conjugates and target polypeptide-ubiquitin conjugates. In a preferred embodiment, 
an assay comprises detecting the target polypeptide-ubiquitin conjugate. In another 
preferred embodiment, an assay comprises detecting the POSH-ubiquitin conjugate. 
An assay described above may be used in a screening assay to identify agents 
25 that modulate a ubiquitin-related activity of a POSH polypeptide. A screening assay 
will generally involve adding a test agent to one of the above assays, or any other 
assay designed to assess a ubiquitin-related activity of a POSH polypeptide. The 
parameter(s) detected in a screening assay may be compared to a suitable reference. 
A suitable reference may be an assay run previously, in parallel or later that omits 
the test agent. A suitable reference may also be an average of previous 
measurements in the absence of the test agent. In general the components of a 
screening assay mixture may be added in any order consistent with the overall 
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activity to be assessed, but certain variations may be preferred. For example, in 
certain embodiments, it may be desirable to pre-incubate the test agent and the E3 
(e.g., the POSH polypeptide), followed by removing the test agent and addition of 
other components to complete the assay. In this manner, the effects of the agent 
5 solely on the POSH polypeptide may be assessed. In certain preferred 
embodiments, a screening assay for an antiviral agent employs a target polypeptide 
comprising an L domain, and preferably an HIV L domain. 

In certain embodiments, an assay is performed in a high-throughput format. 
For example, one of the components of a mixture may be affixed to a solid substrate 
10 and one or more of the other components is labeled. For example, the POSH 
polypeptide may be affixed to a surface, such as a 96-well plate, and the ubiquitin is 
in solution and labeled. An E2 and El are also in solution, and the POSH-ubiquitin 
conjugate formation may be measured by washing the solid surface to remove 
uncomplexed labeled ubiquitin and detecting the ubiquitin that remains bound. 
1 5 Other variations may be used. For example, the amount of ubiquitin in solution may 
be detected. In certain embodiments, the formation of ubiquitin complexes may be 
measured by an interactive technique, such as FRET, wherein a ubiquitin is labeled 
with a first label and the desired complex partner (e.g., POSH polypeptide or target 
polypeptide) is labeled with a second label, wherein the first and second label 
20 interact when they come into close proximity to produce an altered signal. In 
FRET, the first and second labels are fluorophores. FRET is described in greater 
detail below. The formation of polyubiquitin complexes may be performed by 
mixing two or more pools of differentially labeled ubiquitin that interact upon 
formation of a polyubiqutin (see, e.g., US Patent Publication 20020042083). High- 
25 throughput may be achieved by performing an interactive assay, such as FRET, in 
solution as well. In addition, if a polypeptide in the mixture, suchas thePOSH 
polypeptide or target polypeptide, is readily purifiable (e.g., with a specific antibody 
or via a tag such as biotin, FLAG, polyhistidine, etc.), the reaction may be 
performed in solution and the tagged polypeptide rapidly isolated, along with any 
30 polypeptides, such as ubiquitin, that are associated with the tagged polypeptide. 
Proteins may also be resolved by SDS-PAGE for detection. 
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In certain embodiments, the ubiquitin is labeled, either directly or indirectly. 
This typically allows for easy and rapid detection and measurement of ligated 
ubiquitin, making the assay useful for high-throughput screening applications. As 
descrived above, certain embodiments may employ one or more tagged or labeled 
5 proteins. A "tag" is meant to include moieties that facilitate rapid isolation of the 
tagged polypeptide. A tag may be used to facilitate attachment of a polypeptide to a 
surface. A "label" is meant to include moieties that facilitate rapid detection of the 
labeled polypeptide. Certain moieties may be used both as a label and a tag (e.g., 
epitope tags that are readily purified and detected with a well-characterized 
1 0 antibody). Biotinylation of polypeptides is well known, for example, a large number 
of biotinylation agents are known, including amine-reactive and thiol-reactive 
agents, for the biotinylation of proteins, nucleic acids, carbohydrates, carboxylic 
acids; see chapter 4, Molecular Probes Catalog, Haugland, 6th Ed. 1996, hereby 
incorporated by reference. A biotinylated substrate can be attached to a biotinylated 
15 component via avidin or streptavidin. Similarly, a large number of haptenylation 
reagents are also known. 

An "El" is a ubiquitin activating enzyme. In a preferred embodiment, El is 
capable of transferring ubiquitin to an E2. In a preferred embodiment, El forms a 
high energy thiolester bond with ubiquitin, thereby "activating" the ubiquitin. An 
20 "E2" is a ubiquitin carrier enzyme (also known as a ubiquitin conjugating enzyme). 
In a preferred embodiment, ubiquitin is transferred from El to E2. In a preferred 
embodiment, the transfer results in a thiolester bond formed between E2 and 
ubiquitin. In a p referred e mbodiment, E 2 i s c apable o f t ransferring u biquitin t o a 
POSH polypeptide. 

25 In an alternative embodiment, a POSH polypeptide, E2 or target polypeptide 

is bound to a bead, optionally with the assistance of a tag. Following ligation, the. 
beads may be separated from the unbound ubiquitin and the bound ubiquitin 
measured. In a preferred embodiment, POSH polypeptide is bound to beads and the 
composition used includes labeled ubiquitin. In this embodiment, the beads with 

30 bound ubiquitin may be separated using a fluorescence-activated cell sorting 
(FACS) machine; Methods for such use are described in U.S. patent application Ser. 
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No. 09/047,119, which is hereby incorporated in its entirety. The amount of bound 
ubiquitin can then be measured. 

In a screening assay, the effect of a test agent may be assessed by, for 
example, assessing the effect of the test agent on kinetics, steady-state and/or 

5 endpoint of the reaction. 

The components of the various assay mixtures provided herein may be 
combined in varying amounts. In a preferred embodiment, ubiquitin (or E2 
complexed ubiquitin) is combined at a final concentration of from 5 to 200 ng per 
100 microliter reaction solution. Optionally El is used at a final concentration of 
10 from 1 to 50 ng per 100 microliter reaction solution. Optionally E2 is combined at a 
final concentration of 10 to 100 ng per 100 microliter reaction solution, more 
preferably 10-50 ng per 100 microliter reaction solution. In a preferred embodiment, 
POSH polypeptide is combined at a final concentration of from 1 to 500 ng per 100 
microliter reaction solution. 
15 Generally, an assay mixture is prepared so as to favor ubiquitin ligase 

activity and/or ubiquitination activity. Generally, this will be physiological 
conditions, such as 50 - 200 mM salt (e.g., NaCl, KC1), pH of between 5 and 9, and 
preferably between 6 and 8. Such conditions may be optimized through trial and 
error. Incubations m ay be performed at any temperature which f acilitates o primal 
20., activity, typically between 4 and 40 °C. Incubation periods are selected for optimum 
activity, but may also be optimized to facilitate rapid high through put screening. 
Typically between 0.5 and 1.5 hours will be sufficient. A variety of other reagents 
may be included in the compositions. These include reagents like salts, solvents, 
buffers, neutral proteins, e.g., albumin, detergents, etc. which may be used to 
25 facilitate optimal ubiquitination enzyme activity and/or reduce non-specific or 
background interactions. Also reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., 
may be used. The compositions will also preferably include adenosine tri-phosphate 
(ATP). The mixture of components may be added in any order that promotes 
30 ubiquitin ligase activity or optimizes identification of candidate modulator effects. In 
a preferred embodiment, ubiquitin is provided in a reaction buffer solution, followed 
by addition of the ubiquitination enzymes. In an alternate preferred embodiment, 
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ubiquitin is provided in a reaction buffer solution, a candidate modulator is then 
added, followed by addition of the ubiquitination enzymes. 
^ - 4n general, a test agent that decreases a POSH ubiquitin-related activity may 

be used to inhibit POSH function in vivo, while a test agent that increases a POSH 
5 ubiquitin-related activity may be used to stimulate POSH function in vivo. Test 
agent may be modified for use in vivo, e.g., by addition of a hydrophobic moiety, 
such as an ester. 

In certain embodiments, a ubiquitination assay as described above for POSH 
can similarly be conducted for a Cbl-b, a SIAH1, or a TTC3 polypeptide. In certain 

10 embodiments, the application provides assays to identify, optimize or otherwise 
assess agents that increase or decrease a ubiquitin-related activity of a Cbl-b, a 
SIAH1, or a TTC3 polypeptide. Ubiquitin-related activities of Cbl-b, SIAH1, or 
TTC3 polypeptides may include the self-ubiquitination activity of a Cbl-b, SIAH1, 
or TTC3 polypeptide, generally involving the transfer of ubiquitin from an E2 

1 5 enzyme to the respective Cbl-b, SIAH1 , or TTC3 polypeptide, and the ubiquitination 
of a target protein, e.g., the p85 subunit of PI3K, e.g, synaptophysin, generally 
involving the transfer of a ubiquitin from a Cbl-b, SIAH1, or TTC3 polypeptide to 
the target protein, e.g, the p85 subunit of PI3K, e.g., synaptophysin, e.g., 
HERPUD1. In certain embodiments, a Cbl-b, a SIAH1, or a TTC3 activity is 

20 mediated, at least in part, by a RING domain of a Cbl-b, a SIAH1, or a TTC3, 
respectively. 

An additional POSH-AP may be added to a POSH ubiquitination assay to 
assess the effect of the POSH-AP (e.g., PRKAR1A, PRKACA, or PRKACB) on 
POSH-mediated ubiquitination and/or to assess whether the POSH-AP is a target for 

25 POSH-mediated ubiquitination (e.g., HERPUD1 , e.g., PKA). 

Certain embodiments of the application relate to assays for identifying agents 
that b ind t o a P OSH o r P OSH-AP p olypeptide, o ptionally a p articular domain o f 
POSH such as an SH3 or RING domain or a particular domain of a POSH-AP, 
particularly a kinase catalytic domain or ATP binding domain. In preferred 

30 embodiments, a POSH polypeptide is a polypeptide comprising the fourth SH3 

domain of hPOSH (SEQ ID NO: 30). A wide variety of assays maybe used for this 

purpose, including labeled in vitro protein-protein binding assays, electrophoretic 
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mobility shift assays, immunoassays for protein binding, and the like. The purified 
protein may also be used for determination of three-dimensional crystal structure, 
which can be used for modeling intermolecular interactions and design of test 
agents. In one embodiment, an assay detects agents which inhibit interaction of one 
5 or more subject POSH polypeptides with a POSH-AP. In another embodiment, the 
assay detects agents which modulate the intrinsic biological activity of a POSH 
polypeptide or POSH complex, such as an enzymatic activity, binding to other 
cellular components, cellular compartmentalization, and the like. 

Certain embodiments of the application relate to assays for identifying agents 
10 that modulate a POSH-AP polypeptide such as a PKA subunit polypeptide. 
Preferred PKA subunit polypeptides include PRKAR1A, PRKACA, and PRKACB. 
Exemplary assays used for this purpose may include detecting phosphorylation of 
PKA subunit, kinase activity of the PKA subunit, ability of the PKA subunit to elicit 
downstream signaling of the PKA pathway, and the like. For example, activity of 
15 protein kinase A can be assayed either in vitro or in vivo. PKA activity can be 
determined b y d etecting p osphorylation ofaPKAs pecific s ubstrate. T he s pecific 
PKA substrate can be any convenient peptide with a serine that is recognized as a 
phosphorylation site by PKA. For example, the peptide substrate can have the 
sequence: Leu-Arg-Arg-Ala- Ser-Leu-Gly. 
20 In one aspect, the application provides methods and compositions for the 

identification of compositions that interfere with the function of POSH or POSH-AP 
polypeptides. Given the role of POSH polypeptides in viral production, 
compositions that perturb the formation or stability of the protein-protein 
interactions between POSH polypeptides and the proteins that they interact with, 
25 such as POSH-APs, and particularly POSH complexes comprising a viral protein, 
are candidate pharmaceuticals for the treatment of viral infections. 

While not wishing to be bound to mechanism, it is postulated that POSH 
polypeptides promote the assembly of protein complexes that are important in 
release of virions and other biological processes. Complexes of the application may 
30 include a combination of a POSH polypeptide and a POSH-AP. Exemplary 
complexes may comprise one or more of the following: a POSH polypeptide (as in 
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the case of a POSH dimer, a heterodimer of two different POSH, homomultimers 
and heteromultimers); a HERPUD1 polypeptide; or an MSTP028 polypeptide. 

In an assay for an antiviral or antiapoptotic agent, the test agent is assessed 
for its ability to disrupt or inhibit the formation of a complex of a POSH polypeptide 
and a small GTPase, such as a Rac polypeptide, particularly a human Rac 
polypeptide, such as Racl . 

A variety of assay formats will suffice and, in light of the present disclosure, 
those not expressly described herein will nevertheless be comprehended by one of 
ordinary skill in the art. Assay formats which approximate such conditions as 
formation of protein complexes, enzymatic activity, and even a POSH polypeptide- 
mediated membrane reorganization or vesicle formation activity, may be generated 
in many different forms, and include assays based on cell-free systems, e.g., purified 
proteins or cell lysates, as well as cell-based assays which utilize intact cells. 
Simple binding assays can also be used to detect agents which bind to POSH. Such 
binding assays may also identify agents that act by disrupting the interaction 
between a POSH polypeptide and a POSH interacting protein, or the binding of a 
POSH polypeptide or complex to a substrate. Agents to be tested can be produced, 
for example, by bacteria, yeast or other organisms (e.g., natural products), produced 
chemically (e.g., small molecules, including peptidomimetics), or produced 
recombinantly. In a preferred embodiment, the test agent is a small organic 
molecule, e.g., other than a peptide or oligonucleotide, having a molecular weight of 
less than about 2,000 daltons. 

In many drag screening programs which test libraries of compounds and 
natural extracts, high throughput assays are desirable in order to maximize the 
number of compounds surveyed in a given period of time. Assays of the present 
application which are performed in cell-free systems, such as may be developed with 
purified or semi-purified proteins or with lysates, are often preferred as "primary" 
screens in that they can be generated to permit rapid development and relatively easy 
detection of an alteration in a molecular target which is mediated by a test 
compound. Moreover, the effects of cellular toxicity and/or bioavailability of the 
test compound can be generally ignored in the in vitro system, the assay instead 
being focused primarily on the effect of the drug on the molecular target as may be 
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manifest in an alteration of binding affinity with other proteins or changes in v 
enzymatic properties of the molecular target. 
, - _Jn preferred in vitro embodiments of the present assay, a reconstituted POSH 
complex c omprises a reconstituted mixture o f at 1 east s emi-purified p roteins. B y 
5 semi-purified, it is meant that the proteins utilized in the reconstituted mixture have 
been previously separated from other cellular or viral proteins. For instance, in 
contrast to cell lysates, the proteins involved in POSH complex formation are 
present in the mixture to at least 50% purity relative to all other proteins in the 
mixture, and more preferably are present at 90-95% purity. In certain embodiments 
10 of the subject method, the reconstituted protein mixture is derived by mixing highly 
purified proteins such that the reconstituted mixture substantially lacks other 
proteins (such as of cellular or viral origin) which might interfere with or otherwise 
alter the ability to measure POSH complex assembly and/or disassembly. 

Assaying POSH complexes, in the presence and absence of a candidate 
15 inhibitor, can be accomplished in any vessel suitable for containing the reactants. 
Examples include microtitre plates, test tubes, and micro-centrifuge tubes. 

In one embodiment of the present application, drug screening assays can be 
generated which detect inhibitory agents on the basis of their ability to interfere with 
assembly or s lability o f the P OSH c omplex. In a n exemplary binding assay, t he 
20 compound of interest is contacted with a mixture comprising a POSH polypeptide 
and at least one interacting polypeptide. Detection and quantification of POSH 
complexes provides a means for determining the compound's efficacy at inhibiting 
(or potentiating) interaction between the two polypeptides. The efficacy of the 
compound can be assessed by generating dose response curves from data obtained 
25 using various concentrations of the test compound. Moreover, a control assay can 
also be performed to provide a baseline for comparison. In the control assay, the 
formation of complexes is quantitated in the absence of the test compound. 

Complex formation between the POSH polypeptides and a substrate 
polypeptide may be detected by a variety of techniques, many of which are 
30 effectively described above. For instance, modulation in the formation of complexes 
can be quantitated using, for example, detectably labeled proteins (e.g., radiolabeled, 
fluorescently labeled, or enzymatically labeled), by immunoassay, or by 
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chromatographic detection. Surface plasmon resonance systems, such as those 
available from B iacore International AB (Uppsala, Sweden), may also be used to 
detect protein-protein interaction 

Often, it will be desirable to immobilize one of the polypeptides to facilitate 
5 separation of complexes from uncomplexed forms of one of the proteins, as well as 
to accommodate automation of the assay. In an illustrative embodiment, a fusion 
protein can be provided which adds a domain that permits the protein to be bound to 
an insoluble matrix. For example, GST-POSH fusion proteins can be adsorbed onto 
glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione 
10 derivatized microtitre plates, which are then combined with a potential interacting 
protein, e.g., an 35 S-labeled polypeptide, and the test compound and incubated under 
conditions conducive to complex formation. F ollowing i ncubation, the b eads are 
washed to remove any unbound interacting protein, and the matrix bead-bound 
radiolabel determined directly (e.g., beads placed in scintillant), or in the supernatant 
15 after the complexes are dissociated, e.g., when microtitre plate is used. 
Alternatively, after washing away unbound protein, the complexes can be 
dissociated from the matrix, separated by SDS-PAGE gel, and the level of 
interacting polypeptide found in the matrix-bound fraction quantitated from the gel 
using standard electrophoretic techniques. 
20 In a further embodiment, agents that bind to a POSH or POSH-AP may be 

identified by using an immobilized POSH or POSH-AP. In an illustrative 
embodiment, a fusion protein can be provided which adds a domain that permits the 
protein to be bound to an insoluble matrix. For example, GST-POSH fusion 
proteins can be adsorbed onto glutathione sepharose beads (Sigma Chemical, St. 
25 Louis, MO) or glutathione derivatized microtitre plates, which are then combined 
with a potential labeled binding agent and incubated under conditions conducive to 
binding. Following incubation, the beads are washed to remove any unbound agent, 
and the matrix bead-bound label determined directly, or in the supernatant after the 
bound agent is dissociated. 
30 In yet another embodiment, the POSH polypeptide and potential interacting 

polypeptide can be used to generate an interaction trap assay (see also, U.S. Patent 
NO: 5,283,317; Zervos et al. (1993) Cell 72:223-232; Madura et al. (1993) J Biol 
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Chem 268:12046-12054; Bartel et al. (1993) Biotechniques 14:920-924; and 
Iwabuchi et al. ( 1993) Oncogene 8 : 1693- 1696), for subsequently detecting agents 
which disrupt binding of the proteins to one and other. 

In particular, the method makes use of chimeric genes which express hybrid 

5 proteins. To illustrate, a first hybrid gene comprises the coding sequence for a 
DNA-binding domain of a transcriptional activator can be fused in frame to the 
coding sequence for a "bait" protein, e.g., a POSH polypeptide of sufficient length to 
bind to a potential interacting protein. The second hybrid protein encodes a 
transcriptional activation domain fused in frame to a gene encoding a "fish" protein, 

10 e.g., a p otential i nteracting p rotein o f s ufficient length t o i nteract with t he P OSH 
polypeptide portion of the bait fusion protein. If the bait and fish proteins are able to 
interact, e.g., form a POSH complex, they bring into close proximity the two 
domains o f t he t ranscriptional a ctivator. T his p roximity c auses t ranscription o f a 
reporter gene which is operably linked to a transcriptional regulatory site responsive 

15 to the transcriptional activator, and expression of the reporter gene can be detected 
and used to score for the interaction of the bait and fish proteins. 

One aspect of the present application provides reconstituted protein 
preparations including a POSH polypeptide and one or more interacting 
polypeptides. 

20 In still further embodiments of the present assay, the POSH complex is 

generated in whole cells, taking advantage of cell culture techniques to support the 
subject assay. For example, as described below, the POSH complex can be 
constituted in a eukaryotic cell culture system, including mammalian and yeast cells. 
Often it will be desirable to express one or more viral proteins (e.g., Gag or Env) in 

25 such a cell along with a subject POSH polypeptide. It may also be desirable to 
infect the cell with a virus of interest. Advantages to generating the subject assay in 
an intact cell include the ability to detect inhibitors which are functional in an 
environment more closely approximating that which therapeutic use of the inhibitor 
would require, including the ability of the agent to gain entry into the cell. 

30 Furthermore, certain of the in vivo embodiments of the assay, such as examples 
given below, are amenable to high through-put analysis of candidate agents. 
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The components of the POSH complex can be endogenous to the cell 
selected to support the assay. Alternatively, some or all of the components can be 
derived from exogenous sources. For instance, fusion proteins can be introduced 
into the cell by recombinant techniques (such as through the use of an expression 
vector), as well as by microinjecting the fusion protein itself or mRNA encoding the 
fusion protein. 

In many embodiments, a cell is manipulated after incubation with a 
candidate agent and assayed for a POSH or POSH-AP activity. In certain 
embodiments, a POSH-AP, such as PTPN12, is a tyrosine phosphatase. Tyrosine 
phosphatase activity may be assessed by incubating a cell lysate, which has 
optionally been treated with pervanadate to stimulate tyrosine phosphorylation, with 
a POSH-AP that has tyrosine phosphatase activity, immunoprecipitating the 
substrate protein and immunoblotting for the presence of phosphorylated tyrosine. 
Alternatively, tyrosine phosphatase activity may be assessed by the substrate 
trapping method. The substrate trapping method employs catalytically inactive 
mutants of a tyrosine phosphatase (e.g., a POSH-AP such as PTPN12). The 
catalytically inactive phosphatase mutant is immobilized on a solid matrix (e.g., 
AG25-protein A-Sepharose beads) and incubated with a substrate protein. The solid 
matrix to which the catalytically inactive phosphatase is bound is isolated and 
subjected to SDS-PAGE and immunoblotting for the presence of the substrate 
protein. The proteins employed in a phosphatase assay may optionally be purified 
proteins. (Lyons, PD et al (2001) J Biol Chem 246:24422-31 ; Garton, AJ et al 
(1996) Mol Cell Biol 16:6408-18). 

In many embodiments, a cell is manipulated after incubation with a candidate agent 
and assayed for a POSH or POSH-AP activity. In certain embodiments a POSH or 
POSH-AP activity is represented by production of virus like particles. As 
demonstrated herein, an agent that disrupts POSH or POSH-AP activity can cause a 
decrease in the production of virus like particles. Other bioassays for POSH or 
POSH-AP activities may include apoptosis assays (e.g., cell survival assays, 
apoptosis reporter gene assays, etc.) and NF-kB nuclear localization assays (see e.g., 
Tapon et al. (1 998) EMBO J. 17: 1395-1404). One apoptosis assay that may be used 
to assess TGN-associated protein activity is the TUNEL assay, which is used to 
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detect the presence of apoptotic cell death. In the TUNEL assay, the enzyme 
terminal deoxynucleotidyl transferase labels 3'-OH DNA ends (which are generated 
during apoptosis) with biotinylated nucleotides. The biotinylated nucleotides are- 
then detected by immunoperoxidase staining. Another apoptosis assay that may be 

5 used to assess TGN-associated protein activity is the caspase assay, in which caspase 
activity is measured using a blue fluorescent substrate. Cleavage of the substrate by 
caspase 3 releases the fluorochrome, which then fluoresces green. An assay that 
may be employed to monitor cell proliferation associated with a TGN-associated 
protein is the MTT cell proliferation assay. The MTT cell proliferation assay is a 

1 0 colorimetric assay which measures the reduction of a tetrazolium component (MTT) 
into an insoluble formazan product by the mitochondria of viable cells. After 
incubation of the cells with the MTT reagent, a detergent solution is added to lyse 
the cells and solubilize the colored crystals. The samples may be read using an 
ELISA plate reader. The amount of color produced is directly proportional to the 

1 5 number of viable cells. 

In certain embodiments, POSH or POSH-AP activities may include, without 
limitation, complex formation, ubiquitination and membrane fusion events (eg. 
release of viral buds or fusion of vesicles). POSH-AP activity may be assessed by 
the presence of phosphorylated substrate, such as, in the case of PKA, 
20 phosphorylated POSH. The interaction of POSH with a small GTPase such as Rac 
may also be indicative of the absence of phosphorylation of POSH by PKA. POSH 
complex formation may be assessed by immunoprecipitation and analysis of co- 
immunoprecipiated proteins or affinity purification and analysis of co-purified 
proteins. Fluorescence Resonance Energy Transfer (FRET)-based assays or other 
25 energy transfer assays may also be used to determine complex formation. 

The effect of an agent that modulates the activity of POSH or a POSH-AP 
may b e evaluated for e ffects on the trafficking o f a p rotein through the s ecretory 
system. For example, the effects of the agent on the trafficking of the protein may 
be assessed by detecting the glycosylation of the protein in the presence and absence 
30 of the agent, for instance, through the use of antibodies specific for sugar moieties. 
For example, cell lysates from cells treated in the absence and presence of an agent 
that modulates the activity of POSH or a POSH-AP may be subjected to 
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immunoprecipitation and immunoblotting with antibodies directed to the 
glycoprotein of interest and the glycosylation state of the protein then compared. 

Additional bioassays for assessing POSH and POSH-AP activities may 
include assays to detect the improper processing of a protein that is associated with a 

5 neurological disorder. One assay that may be used is an assay to detect the presence, 
including an i ncrease o r a d ecrease i n t he amount, o f a p rotein associated w ith a 
neurological disorder. For example, the use of RNAi may be employed to 
knockdown the expression of a POSH or POSH-AP in cells (e.g., CHO cells or COS 
cells). The production of a secreted protein such as for. example, amyloid beta, in 
10 the cell culture media, can then be assessed and compared to production of the 
secreted protein from control cells, which may be cells in which the POSH or 
POSH-AP activity has not been inhibited. The production of secreted proteins may 
be assessed, such as amyloid beta protein, which is associated with Alzheimer's 
disease. In some instances, a label may be incorporated into a secreted protein and 

15 the presence of the labeled secreted protein detected in the cell culture media. 
Proteins secreted from any cell type may be assessed, including for example, neural 
cells. 

The effect of an agent that modulates the activity of POSH or a POSH-AP 
may be evaluated for effects on mouse models of various neurological disorders. 

20 For example, mouse models of Alzheimer's disease have been described. See, for 
example, United States Patent No. 5,612,486 for 'Transgenic Animals Harboring 
APP Allele Having Swedish Mutation,"Patent No. 5,850,003 (the* 003 patent) for 
'Transgenic Rodents Harboring APP Allele Having Swedish Mutation,"and United 
States Patent No. 5,455,169 entitled "Nucleic Acids for Diagnosing and Modeling 

25 Alzheimer's Disease". Mouse models of Alzheimer's disease tend to produce 
elevated levels of beta-amyloid protein in the brain, and the increase or decrease of 
such protein in response to treatment with a test agent may be detected. In some 
instances, it may also be desirable to assess the effects of a test agent on cognitive or 
behavioral characteristics of a mouse model for Alzheimer's disease, as well as 

30 mouse models for other neurological disorders. 

In a further embodiment, transcript levels may be measured in cells having 
higher or lower levels of POSH or POSH-AP activity in order to identify genes that 
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are regulated by POSH or POSH-APs. Promoter regions for such genes (or larger 
portions of such genes) may be operatively linked to a reporter gene and used in a 
- reporter gene-based assay to detect agents that enhance or diminish POSH- or 
POSH-AP-regulated gene expression. Transcript levels may be determined in any 
5 way known in the art, such as, for example, Northern blotting, RT-PCR, microarray, 
etc. Increased POSH activity may be achieved, for example, by introducing a strong 
POSH expression vector. Decreased POSH activity may be achieved, for example, 
by RNAi, antisense, ribozyme, gene knockout, etc. 

In general, where the screening assay is a binding assay (whether protein- 
ic) protein binding, agent-protein binding, etc.), one or more of the molecules may be 
joined to a label, where the label can directly or indirectly provide a detectable 
signal. Various labels include radioisotopes, fluorescers, chemiluminescers, 
enzymes, specific binding molecules, particles, e.g., magnetic particles, and the like. 
Specific b hiding m olecules i nclude p airs, s uch as b iotin and s treptavidin, d igoxin 
15 and antidigoxin etc. For the specific binding members, the complementary member 
would normally be labeled with a molecule that provides for detection, in 
accordance with known procedures. 

In further embodiments, the application provides methods for identifying 
targets for therapeutic intervention. A polypeptide that interacts with POSH or 
20 participates in a POSH-mediated process (such as viral maturation) may be used to 
identify candidate therapeutics. Such targets may be identified by identifying 
proteins that associated with POSH (POSH-APs) by, for example, 
immunoprecipitation with an anti-POSH antibody, in silico analysis of high- 
throughput binding data, two-hybrid screens, and other protein-protein interaction 
25 assays described herein or otherwise known in the art in view of this disclosure. 
Agents t hat b ind t o s uch t argets o r d isrupt p rotein-protein i nteractions t hereof, o r 
inhibit a biochemical activity thereof may be used in such an assay. Targets that 
have been identified by such approaches include POSH-APs provided in Tables 7 
and 8 and in Figure 36. 

30 A variety of other reagents may be included in the screening assay. These 

include reagents like salts, neutral proteins, e.g., albumin, detergents, etc that are 
used to facilitate optimal protein-protein binding and/or reduce nonspecific or 
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background interactions. Reagents that improve the efficiency of the assay, such as 
protease inhibitors, nuclease inhibitors, anti- microbial agents, etc. may be used. The 
mixture of components are added in any order that provides for the requisite 
binding. Incubations are performed at any suitable temperature, typically between 4 
°C and 40 °C. Incubation periods are selected for optimum activity, but may also be 
optimized to facilitate rapid high-throughput screening. 

In certain embodiments, a test agent may be assessed for antiviral or 
anticancer activity by assessing effects on an activity (function) of a POSH-AP. 
Activity (function) may be affected by an agent that acts at one or more of the 
transcriptional, translational or post-translational stages. For example, an siRNA 
directed to a POSH-AP encoding gene will decrease activity, as will a small 
molecule that interferes with a catalytic activity of a POSH-AP. In certain 
embodiments, the agent inhibits the activity of one or more polypeptides selected 
from among HERPUDl and MSTP028. 



7. Exemplary Nucleic Acids and Expression Vectors 

In certain aspects, the application relates to nucleic acids encoding POSH 
polypeptides, such as, for example, SEQ ED Nos: 2, 5, 7, 9, 11, 26, 27, 28, 29 and 
30. Nucleic acids of the application are further understood to include nucleic acids 

20 that comprise variants of SEQ ED Nos:l, 3, 4, 6, 8, 10, 31, 32, 33, 34, and 35. 
Variant nucleotide sequences include sequences that differ by one or more 
nucleotide substitutions, additions or deletions, such as allelic variants; and will, 
therefore, include coding sequences that differ from the nucleotide sequence of the 
coding sequence designated in SEQ ID Nos:l, 3, 4, 6, 8 10, 31, 32, 33, 34, and 35, 

25 e.g., due to the degeneracy of the genetic code. In other embodiments, variants will 
also include sequences that will hybridize under highly stringent conditions to a 
nucleotide sequence of a coding sequence designated in any of SEQ ID Nos:l, 3, 4, 
6, 8 10, 31, 32, 33, 34, and 35. Preferred nucleic acids of the application are human 
POSH sequences, including, for example, any of SEQ ID Nos: 1, 3, 4, 6, 31, 32, 33, 

30 34, 35 and variants thereof and nucleic acids encoding an amino acid sequence 
selected from among SEQ ED Nos: 2, 5, 7, 26, 27, 28, 29 and 30. 
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In certain aspects, the application relates to nucleic acids encoding POSH-AP 
polypeptides. For example, POSH-APs of the disclosure are listed in Table 7. 
Nucleic acid sequences encoding these POSH-APs are provided in Figure 36. 
Additional examples of POSH-APs of the disclosure are provided in Table 8. In 
certain embodiments, variants will also include nucleic acid sequences that will 
hybridize under highly stringent c onditions to a nucleotide sequence of a coding 
sequence of a POSH-AP. Preferred nucleic acids of the application are human 
POSH-AP sequences and variants thereof. 

One of ordinary skill in the art will understand readily that appropriate 
stringency conditions which promote DNA hybridization can be varied. For 
example, one could perform the hybridization at 6.0 x sodium chloride/sodium 
citrate (SSC) at about 45 °C, followed by a wash of 2.0 x SSC at 50 °C. For 
example, the salt concentration in the wash step can be selected from a low 
stringency of about 2.0 x SSC at 50 °C to a high stringency of about 0.2 x SSC at 50 
°C. In addition, the temperature in the wash step can be increased from low 
stringency conditions at room temperature, about 22 °C, to high stringency 
conditions at about 65 °C. Both temperature and salt may be varied, or temperature 
or salt concentration may be held constant while the other variable is changed. In 
one embodiment, the application provides nucleic acids which hybridize under low 
stringency conditions of 6 x SSC at room temperature followed by a wash at 2 x 
SSC at room temperature. 

Isolated nucleic acids which differ from the POSH nucleic acid sequences or 
from the POSH-AP nucleic acid sequences due to degeneracy in the genetic code are 
also within the scope of the application. For example, a number of amino acids are 
designated by more than one triplet. Codons that specify the same amino acid, or 
synonyms (for example, CAU and CAC are synonyms for histidine) may result in 
"silent" mutations which do not affect the amino acid sequence of the protein. 
However, it is expected that DNA sequence polymorphisms that do lead to changes 
in the amino acid sequences o f the subject proteins will exist among mammalian 
cells. One skilled in the art will appreciate that these variations in one or more 
nucleotides (up to about 3-5% of the nucleotides) of the nucleic acids encoding a 
particular protein may exist among individuals of a given species due to natural 
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allelic variation. Any and all such nucleotide variations and resulting amino acid 
polymorphisms are within the scope of this application. 
- -Optionally, a POSH or a POSH-AP nucleic acid of the application will 

genetically complement a partial or complete loss of function phenotype in a cell. 
For example, a POSH nucleic acid of the application may be expressed in a cell in 
which endogenous POSH has been reduced by RNAi, and the introduced' POSH 
nucleic acid will mitigate a phenotype resulting from the RNAi. An exemplary 
POSH loss of function phenotype is a decrease in virus-like particle production in a 
cell transfected with a viral vector, optionally an HIV vector. 
> Another aspect of the application relates to POSH and POSH-AP nucleic 

acids that are used for antisense, RNAi or ribozymes. As used herein, nucleic acid 
therapy refers to administration or in situ generation of a nucleic acid or a derivative 
thereof which specifically hybridizes (e.g., binds) under cellular conditions with the 
cellular mRNA and/or genomic DNA encoding one of the POSH or POSH-AP 
5 polypeptides so as to inhibit production of that protein, e.g., by inhibiting 
transcription and/or translation. The binding may be by conventional base pair 
complementarity, or, for example, in the case of binding to DNA duplexes,- through 
specific interactions in the major groove of the double helix. 

A nucleic acid therapy construct of the present application can be delivered, 
0 for example, as ah expression plasmid which, when transcribed in the cell, produces 
RN A w hich i s c omplementary t o a 1 1 east a u nique p ortion o f t he c ellular m RNA 
which encodes a POSH or POSH-AP polypeptide. Alternatively, the the construct is 
an oligonucleotide which is generated ex vivo and which, when introduced into the 
cell causes inhibition of expression by hybridizing with the mRNA and/or genomic 
5 sequences encoding a POSH or POSH-AP polypeptide. Such oligonucleotide 
probes are optionally modified oligonucleotide which are resistant to endogenous 
nucleases, e.g., exonucleases and/or endonucleases, and is therefore stable in vivo. 
Exemplary nucleic acid molecules for use as antisense oligonucleotides are 
phosphoramidate, phosphothioate and methylphosphonate analogs of DNA (see also 
0 U.S. Patents 5,176,996; 5,264,564; and 5,256,775). Additionally, general 
approaches to constructing oligomers useful in nucleic acid therapy have been 
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reviewed, for example, by van der Krol et al„ (1988) Biotechniques 6:958-976; and 
Stein et al., (1988) Cancer Res 48:2659-2668. 

Accordingly, the modified oligomers of the application are useful in ' 
therapeutic, diagnostic, and research contexts. In therapeutic applications, the 

5 oligomers are utilized in a manner appropriate for nucleic acid therapy in general. 

In another aspect of the application, the subject nucleic acid is provided in an 
expression vector comprising a nucleotide sequence encoding a POSH or POSH-AP 
polypeptide and operably linked to at least one regulatory sequence. Regulatory 
sequences are art-recognized and are selected to direct expression of the POSH or 

10 POSH-AP polypeptide. Accordingly, the term regulatory sequence includes 
promoters, enhancers and other expression control elements. Exemplary regulatory 
sequences are described in Goeddel; Gene Expression Technology: Methods in 
Enzymology, Academic Press, San Diego, CA (1990). For instance, any of a wide 
variety of expression control sequences that control the expression of a DNA 

1 5 sequence when operati vely linked to it may be used in these vectors to express DNA 
sequences encoding a POSH or POSH-AP polypeptide. Such useful expression 
control sequences, include, for example, the early and late promoters of SV40, tet 
promoter, adenovirus or cytomegalovirus immediate early promoter, the lac system, 
the trp system, the TAC or TRC system, T7 promoter whose expression is directed 

20 by T7 RNA polymerase, the major operator and promoter regions of phage lambda , 
the control regions for fd coat protein, the promoter for 3-phosphoglycerate kinase 
or other glycolytic enzymes, the promoters of acid phosphatase, e.g., Pho5, the 
promoters of the yeast a-mating factors, the polyhedron promoter of the baculo virus 
system and other sequences known to control the expression of genes of prokaryotic 

25 or eukaryotic cells or their viruses, and various combinations thereof. It should be x 
understood that the design of the expression vector may depend on such factors as 
the choice of the host cell to be transformed and/or the type of protein desired to be 
expressed. Moreover, the vector's copy number, the ability to control that copy 
number and the expression of any other protein encoded by the vector, such as 

30 antibiotic markers, should also be considered. 

As will be apparent, the subject gene constructs can be used to cause 
expression of the POSH or POSH-AP polypeptides in cells propagated in culture, 
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e.g., to produce proteins or polypeptides, including fusion proteins or polypeptides, 
for purification. 

This application also pertains to a host cell transfected with a recombinant 
gene including a coding sequence for one or more of the POSH or POSH-AP 
5 polypeptides. The host cell may be any prokaryotic or eukaryotic cell. For 
example, a polypeptide of the present application may be expressed in bacterial cells 
such as E. coli, insect cells (e.g., using a baculovirus expression system), yeast, or 
mammalian cells. Other suitable host cells are known to those skilled in the art. 
Accordingly, the present application further p ertains to m ethods o f producing the 
10 POSH or POSH-AP polypeptides. For example, a host cell transfected with an 
expression vector encoding a POSH polypeptide can be cultured under appropriate 
conditions to allow expression of the polypeptide to occur. The polypeptide may be 
secreted and isolated from a mixture of cells and medium containing the 
polypeptide. Alternatively, the polypeptide may be retained cytoplasmically and the 
15 cells harvested, lysed and the protein isolated. A cell culture includes host cells, 
media and other byproducts. Suitable media for cell culture are well known in the 
art. The polypeptide can be isolated from cell culture medium, host cells, or both 
using techniques known in the art for purifying proteins, including ion-exchange 
chromatography, gel filtration chromatography, ultrafiltration, electrophoresis, and 
20 immunoaffinity p urification w ith antibodies specific for p articular epitopes o f the 
polypeptide. In a preferred embodiment, the POSH or POSH-AP polypeptide is a 
fusion protein containing a domain which facilitates its purification, such as a 
POSH-GST fusion protein, POSH-intein fusion protein, POSH-cellulose binding 
domain fusion protein, POSH-polyhistidine fusion protein etc. 
25 A recombinant POSH or POSH-AP nucleic acid can be produced by ligating 

the cloned gene, or a portion thereof, into a vector suitable for expression in either 
prokaryotic cells, eukaryotic cells, or both. Expression vehicles for production of a 
recombinant POSH or POSH-AP polypeptides include plasmids and other vectors. 
For instance, suitable vectors for the expression of a POSH polypeptide include 
30 plasmids of the types: pBR322-derived plasmids, pEMBL-derived plasmids, pEX- 
derived plasmids, pBTac-derived plasmids and pUC-derived plasmids for 
expression in prokaryotic cells, such as E. coli. 
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... The preferred mammalian expression vectors contain both prokaryotic 
sequences to facilitate the propagation of the vector in bacteria, and one or more 
^ - eiikaryotic transcription units that are expressed in eukaryotic cells. The 
pcDNAI/amp, pcDNAI/neo, pRc/CMV, pSV2gpt, pSV2neo, pSV2-dhfr, pTk2, 
5 pRSVneo, pMSG, pSVT7, pko-neo and pHyg derived vectors are examples of 
mammalian expression vectors suitable for transfection of eukaryotic cells. Some of 
these vectors are modified with sequences from bacterial plasmids, such as pBR322, 
to facilitate replication and drug resistance selection in both prokaryotic and 
eukaryotic cells. Alternatively, derivatives of viruses such as the bovine papilloma 
10 virus (BPV-1), or Epstein-Barr virus (pHEBo, pREP-derived and p205) can be used 
for transient expression of proteins in eukaryotic cells. Examples of other viral 
(including retroviral) expression systems can be found below in the description of 
gene therapy delivery systems. The various methods employed in the preparation of 
the'plasmids and transformation of host organisms are well known in the art. For 
15 other suitable expression systems for both prokaryotic and eukaryotic cells, as well 
as general recombinant procedures, see Molecular Cloning A Laboratory Manual, 
2nd E d., e d. b y S ambrook, F ritsch and M aniatis ( Cold S pring H arbor Laboratory 
Press, 1989) Chapters 16 and 17. In some instances, it may be desirable to express 
the recombinant POSH or POSH-AP polypeptide by the use of a baculovirus 
20 expression system. Examples of such baculovirus expression systems include pVL- 
derived vectors (such as pVL1392, pVL1393 and pVL941), pAcUW-derived vectors 
(such as pAcUWl), and pBlueBac-derived vectors (such as the B-gal containing 
pBlueBacIII). 

Alternatively, the coding sequences for the polypeptide can be incorporated 
25 as a part of a fusion gene including a nucleotide sequence encoding a different 
polypeptide. This type of expression system can be useful under conditions where it 
is desirable, e.g., to produce an immunogenic fragmentof a POSH orPOSH-AP 
polypeptide. For example, the VP6 capsid protein of rotavirus can be used as an 
immunologic carrier protein for portions of polypeptide, either in the monomeric 
30 form or in the form of a viral particle. The nucleic acid sequences corresponding to 
the p ortion o f the P OSH o r P OSH-AP p olypeptide t o w hich antibodies are t o b e 
raised can be incorporated into a fusion gene construct which includes coding 
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sequences for a late vaccinia virus structural protein to produce a set of recombinant 
viruses expressing fusion proteins comprising a portion of the protein as part of the 
virion. The Hepatitis B surface antigen can also be utilized in this role as well. 
Similarly, chimeric constructs coding for fusion proteins containing a portion of a 
5 POSH polypeptide and the poliovirus capsid protein can be created to enhance 
immunogenicity (see, for example, EP Publication NO: 0259149; and Evans et al.„ 
(1989) Nature 339:385; Huang et al., (1988) J. Virol. 62:3855; and Schlienger et al., 

(1992)7. Virol 66:2). 

The Multiple Antigen Peptide system for peptide-based immunization can be 

10 utilized, wherein a desired portion of a POSH or POSH-AP polypeptide is obtained 
directly from organo-chemical synthesis of the peptide onto an oligomeric branching 
lysine core (see, for example, Posnett et al., (1988) JBC 263:1719 and Nardelli et 
al., (1992) J. Immunol. 148:914). Antigenic determinants of a POSH or POSH-AP 
polypeptide can also be expressed and presented by bacterial cells. 

15 In another embodiment, a fusion gene coding for a purification leader 

sequence, such as a poly-(His)/enterokinase cleavage site sequence at the N- 
terminus of the desired portion of the recombinant protein, can allow purification of 
the expressed fusion protein by affinity chromatography using a Ni 2+ metal resin. 
The purification leader sequence can then be subsequently removed by treatment 

20 with enterokinase to provide the purified POSH or POSH-AP polypeptide (e.g., see 
Hochuli et al., (1987) J. Chromatography 411:177; and Janknecht et al., PNAS USA 
88:8972). 

Techniques for making fusion genes are well known. Essentially, the joining 
of various DNA fragments coding for different polypeptide sequences is performed 

25 in accordance with conventional techniques, employing blunt-ended or stagger- 
ended termini for ligation, restriction enzyme digestion to provide for appropriate 
termini, filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to 
avoid undesirable joining, and enzymatic ligation. In another embodiment, the 
fusion gene can be synthesized by conventional techniques including automated 

30 DNA synthesizers. Alternatively, PCR amplification of gene fragments can be 
carried out using anchor primers which give rise to complementary overhangs 
between two consecutive gene fragments which can subsequently be annealed to 
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generate a chimeric gene sequence (see, for example, Current Protocols in 
Molecular Biology, eds. Ausubel et al., John Wiley & Sons: 1992). 



Table 2: Exemplary POSH nucleic acids 



Sequence Name 


Oreanism 


Accession Number 








cDNA FLJ11367 fis, clone 

HEMBA1 000303 

i 


Homo sapiens 


AK021429 


Plenty of SH3 domains 
(POSH) mRNA 


Mus musculus 


NM_021506 


Plenty of SH3s (POSH) 
mRNA 


Mus musculus 


AF030131 


Plenty of SH3s (POSH) 
mRNA 


Drosophila melanogaster 


NM_079052 


Plenty of SH3s (POSH) 
mRNA 


Drosophila melanogaster 


AF220364 


Table 3: Exemplary POSH polypeptides 


Sequence Name 


Oreanism 


Accession Number 








SH3 domains- 
containing protein POSH 


Mus musculus 


T09071 


plenty of SH3 domains 


Mus musculus 


NP_067481 


Plenty of SH3s; POSH 


Mus musculus 


AAC40070 


Plenty of SH3s 


Drosophila melanogaster 


AAF37265 


LD45365p 


Drosophila melanogaster 


AAK93408 


POSH gene product 


Drosophila melanogaster 


AAF57833 
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Plenty ofSH3s 



Drosophila melanogaster 



NP 523776 



In addition the following Tables provide the nucleic acid sequence and 
related SEQ ID NOs for domains of human POSH protein and a summary of POSH 
sequence identification numbers used in this application. 



Table 4. Nucleic Acid Sequences and related SEQ ID NOs for domains in human 
POSH 



Name of the 
sequence 


Sequence 


SEQ ID 
NO. 


RING domain 


TGTCCGGTGTGTCTA6AGCGCCTTGATGCTTCTGCGAAGGTCT 
TCjCCTTGCCAGCATACGTTTTGCAAGCGATGTTTGCT 

GGGGATCGTAGGTTCTCGAAATGAACTCAGATGTCCCGAGT 


31 


1 st SHy 

domain 


CCATGTGCCAAAGCGTTATACAACTATGAAGGAAAAGAGCCTG 
GAGACCTTAAATTCAGCAAAGGCGACATCATCATTTT 

GCGAAGACAAGTGGATGAAAATTGGTACCATGGGGAAGTCAAT 
GGAATCCATGGCTTTTTCCCCACCAACTTTGTGCAGA 

TTATT 


32 


2 nd SH 3 
domain 


CCTCAGTGCAAAGCACTTTATGACTTTGAAGTGAAAGACAAGG 
AAGCAGACAAAGATTGCCTTCCATTTGCAAAGGATGA 

TGTTCTGACTGTGATCCGAAGAGTGGATGAAAACTGGGCTGAA 
GGAATGCTGGCAGACAAAATAGGAATATTTCCAATTT 

CATATGTTGAGTTTAAC 


33 


3 rd SH 3 
domain 


AGTGTGTATGTTGCTATATATCCATACACTCCTCGGAAAGAGG 
ATGAACTAGAGCTGAGAAAAGGGGAGATGTTTTTAGT 

GTTTGAGCGCTGCCAGGATGGCTGGTTCAAAGGGACATCCATG 
CATACCAGCAAGATAGGGGTTTTCCCTGGCAATTATG 

TGGCACCAGTC 


34 
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4 th SH 3 


GAAAGGCACAGGGTGGTGGTTTCCTATCCTCCTCAGAGTGAGG 


35 


domain 


CAGAACTTGAACTTAAAGAAGGAGATATTGTGTTTGT 






TCATAAAAAACGAGAGGATGGCTGGTTCAAAGGCACATTACAA 






CGTAATGGGAAAACTGGCCTTTTCCCAGGAAGCTTTG 






TGGAAAACA 





Table 5. Summary of POSH sequence Identification Numbers 



Sequence Information 


Sequence Identification Number 
(SEQ ID NO) 


Human POSH Coding Sequence 


SEQ ID No: 1 


Human POSH Amino Acid Sequence 


SEQ ID No: 2 


Human POSH cDNA Sequence 


SEQ ID No: 3 


5' cDNA Fragment of Human POSH j 


SEQ ID No: 4 


N- terminus Protein Fragment of 
Human . POSH 


SEQ ID NO: 5 


3' mRNA Fragment of Human POSH 


SEQ ID No: 6 


C- terminus Protein Fragment of 
Human POSH 


SEQ ID No: 7 


Mouse POSH mRNA Sequence 


SEQ ID No: 8 


Mouse POSH Protein Sequence 


SEQ ID No: 9 j 


Drosophila melanogaster POSH 
mRNA Sequence 


SEQ ID NO: 10 


Drosophila melanogaster POSH 
Protein Sequence 


SEQ ID No: 11 


Human POSH RING Domain Amino 
Acid Sequence 


SEQ ID No: ,26 


Human POSH 1 st SH 3 Domain Amino 
Acid Sequence 


SEQ ID No: 27 


Human POSH 2 nd SH 3 Domain Amino 
Acid Sequence 


SEQ ID No: 28 


Human POSH 3 rd SH 3 Domain Amino 
Acid Sequence 


SEQ ID No: 29 


Human POSH 4 tn SH 3 Domain Amino 
Acid Sequence 


SEQ ID No: 3 0 


Human POSH RING Domain Nucleic 
Acid Sequence 


SEQ ID NO: 31 
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tjt im an onc5H 1 st SH ■» Domain Nucleic 
Acid Sequence 


SEQ ID No: 32 


Human POSH 2 nd SH 3 Domain Nucleic 
Acid Sequence 


SEQ ID No: 33 


Human POSH 3 ra SH 3 Domain Nucleic 
Acid Sequence 


SEQ ID No: 34 


Human POSH 4 tn SH 3 Domain Nucleic 
Acid Sequence 


SEQ ID No: 35 



8. Exemplary Polypeptides 

In certain aspects, the present application relates to POSH polypeptides, 
which are isolated from, or otherwise substantially free of, other intracellular 

5 proteins which might normally be associated with the protein or a particular 
complex including the protein. In certain embodiments, POSH polypeptides have an 
amino acid sequence that is at least 60% identical to an amino acid sequence as set 
forth in any of SEQ ID Nos: 2, 5, 7, 9, 11, 26, 27, 28, 29 and 30. In other 
embodiments, the polypeptide has an amino acid sequence at least 65%, 70%, 75%, 

10 80%, 85%, 90%, 95%, 97%, 98%, 99% or 100% identical to an amino acid sequence 
as set forth in any of SEQ ID Nos: 2, 5, 7, 9, 11, 26, 27, 28, 29 and 30. 

In certain aspects, the application also relates to POSH-AP polypeptides 
(e.g., a POSH-AP provided in Table 7). Amino acid sequences of the POSH-APs 
listed in Table 7 are provided in Figure 36. Additional POSH-AP polypeptides are 

15 provided in Table 8. In certain embodiments, POSH-AP polypeptides have an 
amino acid sequence that is at least 60% identical to an amino acid sequence as set 
forth in Figure 36. In other embodiments, the POSH-AP polypeptide has an amino 
acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, 99% or 
100% identical to an amino acid sequence as set forth in Figure 36. 

20 Optionally, a POSH or POSH-AP polypeptide of the application will 

function in place of an endogenous POSH or POSH-AP polypeptide, for example by 
mitigating a partial or complete loss of function phenotype in a cell. For example, a 
POSH polypeptide of the application may be produced in a cell in which 
endogenous POSH has been reduced by RNAi, and the introduced POSH 

25 polypeptide will mitigate a phenotype resulting from the RNAi. An exemplary 
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15 



30 



POSH loss of function phenotype is a decrease in virus-like particle production in a 
cell transfected with a viral vector, optionally an HIV vector. In certain 
embodiments, a POSH polypeptide, when produced at an effective level in a cell, 
induces apoptosis. 

5 In another aspect, the application provides polypeptides that are agonists or 

antagonists of a POSH or POSH-AP polypeptide. Variants and fragments of a 
POSH or POSH-AP polypeptide may have a hyperactive or constitutive activity or 
alternatively, act to prevent POSH or POSH-AP polypeptides from performing one' 
or more functions. For example, a truncated form lacking one or more domain may 
10 have a dominant negative effect. 

Another aspect of the application relates to polypeptides derived from a full- 
length POSH or POSH-AP polypeptide. Isolatedpeptidylportionsofthesubject 
• protems can be obtained by screening polypeptides recombinant^ produced from 
the corresponding fragment of the nucleic acid encoding such polypeptides In 
add-on, fragments can be chemically synthesized using techniques known in the art 
such as conventional Merrifield solid phase f-Moc or t-Boc chemistry. For example 
any one of the subject proteins can be arbitrarily divided into fragments of desired 
length with no overlap of the fragments, or preferably divided into overlapping 
fragments of a desired length. The fragments can be produced (recombinant^ or by 
chemical synthesis) and tested to identify those peptidyl fr agme nts which can 
function as either agonists or antagonists of the formation of a specific protein 
complex, or more generally of a POSH.POSH-AP complex, such as by 
microinjection assays. 

It is also possible to modify the structure of the POSH or POSH-AP 
polypeptides for such purposes as enhancing therapeutic or prophylactic efficacy or 
stabmty (e.g., ex vivo shelf life and resistance to proteolytic degradation in vivo) 
Such mod,fied polypeptides, when designed to retain at least one activity of the 
naturally-occurring form of the protein, are considered functional equivalents of the 
POSH or POSH-AP polypeptides described in more detail herein. Such modified 
polypeptides can be produced, for instance, by amino acid substitution, deletion or 
addition. ' 
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For instance, it is reasonable to expect, for example, that an isolated 
replacement of a leucine with an isoleucine or valine, an aspartate with a glutamate, 
a threonine with a serine, or a similar replacement of an amino acid with a 
structurally related amino acid (i.e,. conservative mutations) will not have a major 
5 effect on the biological activity of the resulting molecule. Conservative replacements 
are those that take place within a family of amino acids that are related in their side 
chains. Genetically encoded amino acids are can be divided into four families (see, 
for example, Biochemistry, 2nd ed., Ed. by L. Stryer, W.H. Freeman and Co., 1 981). 
Whether a change in the amino acid sequence of a polypeptide results in a functional 
10 homolog can be readily determined by assessing the ability of the variant 
polypeptide to produce a response in cells in a fashion similar to the wild-type 
protein. For instance, such variant forms of a POSH polypeptide can be assessed, 
e.g., for their ability to bind to another polypeptide, e.g., another POSH polypeptide 
or another protein involved in viral maturation. Polypeptides in which more than 
1 5 one replacement has taken place can readily be tested in the same manner. 

This application further contemplates a method of generating sets of 
combinatorial mutants of the POSH or POSH-AP polypeptides, as well as truncation 
mutants, and is especially useful for identifying potential variant sequences (e.g., 
homologs) that are functional in binding to a POSH or POSH-AP polypeptide. The 
20 purpose of screening such combinatorial libraries is to generate, for example, POSH 
homologs which can act as either agonists or antagonist, or alternatively, which 
possess novel activities all together. CombinatoriaUy-derived homologs can be 
generated which have a selective potency relative to a naturally occurring POSH or 
POSH-AP polypeptide. Such proteins, when expressed from recombinant DNA 
25 constructs, can be used in gene therapy protocols. 

Likewise, mutagenesis can give rise to homologs which have intracellular 
half-lives dramatically different than the corresponding wild-type protein. For 
example, the altered protein can be rendered either more stable or less stable to 
proteolytic degradation or other cellular process which result in destruction of, or 
30 otherwise inactivation of the POSH or POSH-AP polypeptide of interest. Such 
homologs, and the genes which encode them, can be utilized to alter POSH or 
POSH-AP levels by modulating the half-life of the protein. For instance, a short 
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half-life can give rise to more transient biological effects and, when part of an 
inducible expression system, can allow tighter control of recombinant POSH or 
^ - POSH-AP levels within the cell. As above, such proteins, and particularly their 
recombinant nucleic acid constructs, can be used in gene therapy protocols. 

5 In similar fashion, POSH or POSH-AP homologs can be generated by the 

present combinatorial approach to act as antagonists, in that they are able tolnterfere 
with the ability of the corresponding wild-type protein to function. 

In a representative embodiment of this method, the amino acid sequences for 
a population of POSH or POSH-AP homologs are aligned, preferably to promote the 

10 highest homology possible. Such a population of variants can include, for example, 
homologs from one or more species, or homologs from the same species but which 
differ due to mutation. Amino acids which appear at each position of the aligned 
sequences are selected to create a degenerate set of combinatorial sequences. In a 
preferred embodiment, the combinatorial library is produced by way of a degenerate 

15 library of genes encoding a library of polypeptides which each include at least a 
portion of potential POSH or POSH-AP sequences. For instance, a mixture of 
synthetic oligonucleotides can be enzymatically ligated into gene sequences such 
that the degenerate set of potential POSH or POSH-AP nucleotide sequences are 
expressible as individual polypeptides, or alternatively, as a set of larger fusion 

20 proteins (e.g., for phage display). 

There are many ways by which the library of potential homologs can be 
generated from a degenerate oligonucleotide sequence. Chemical synthesis of a 
degenerate gene sequence can be carried out in an automatic DNA synthesizer, and 
the synthetic genes then be ligated into an appropriate gene for expression. The 

25 purpose of a degenerate set of genes is to provide, in one mixture, all of the 
sequences encoding the desired set of potential POSH or POSH-AP sequences. The 
synthesis of degenerate oligonucleotides is well known in the art (see for example, 
Narang, SA (1983) Tetrahedron 39:3; Itakura et al., (1981) Recombinant DNA, 
Proc. 3rd Cleveland Sympos. Macromolecules, ed. AG Walton, Amsterdam: 

30 Elsevier pp273-289; Itakura et al., (1984) Annu. Rev. Biochem. 53:323; Itakura et 
al., (1984) Science 198:1056; Dee et al., (1983) Nucleic Acid Res. 11:477). Such 
techniques have been employed in the directed evolution of other proteins (see, for 
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example, Scott et al., (1990) Science 249:386-390; Roberts et al., (1992) PNAS 
USA 89:2429-2433; Devlin et al., (1990) Science 249: 404-406; Cwirla et al., 
(1990) PNAS USA 87: 6378-6382; as well as U.S. Patent Nos: 5,223,409, 
5,198,346, and 5,096,815). 
5 Alternatively, other forms of mutagenesis can be utilized to generate a 

combinatorial 1 ibrary. F or e xample, P OSH o r P OSH-AP h omologs ( both a gonist 
and antagonist forms) can be generated and isolated from a library by screening 
using, for example, alanine scanning mutagenesis and the like (Ruf et al., ( 1994) 
Biochemistry 33:1565-1572; Wang et al., (1994) J. Biol. Chem. 269:3095-3099; 
10 Balintet al., (1993) Gene 137:109-1 18; Grodberg et al., (1993) Eur. J. Biochem. 
218:597-601; Nagashima et al., (1993) J. Biol. Chem. 268:2888-2892; Lowman et 
al., (1991) Biochemistry 30:10832-10838; and Cunningham et al., (1989) Science 
244:1081-1085), by linker scanning mutagenesis (Gustin et al., (1993) Virology 
193:653-660; Brown et al., (1992) Mol. Cell Biol. 12:2644-2652; McKnight et al., 
15 (1982) Science 232:316); by saturation mutagenesis (Meyers et al., (1986) Science 
232:613); by PCR mutagenesis (Leung et al., (1989) Method Cell Mol Biol 1:11- 
19); or by random mutagenesis, including chemical mutagenesis, etc. (Miller et al., 
(1992) A Short Course in Bacterial Genetics, CSHL Press, Cold Spring Harbor, NY; 
and Greener et al., (1994) Strategies in Mol Biol 7:32-34). Linker scanning 
20 mutagenesis, particularly in a combinatorial setting, is an attractive method for 
identifying truncated (bioactive) forms of POSH or POSH-AP polypeptides. 

A wide range of techniques are known in the art for screening gene products 
of combinatorial libraries made by point mutations and truncations, and, for that 
matter, for screening cDNA libraries for gene products having a c ertain p roperty. 
25 Such techniques will be generally adaptable for rapid screening of the gene libraries 
generated by the combinatorial mutagenesis of POSH or POSH-AP homologs. The 
most widely used techniques for screening large gene libraries typically comprises 
cloning the gene library into replicable expression vectors, transforming appropriate 
cells with the resulting library of vectors, and expressing the combinatorial genes 
30 under conditions in which detection of a desired activity facilitates relatively easy 
isolation of the vector encoding the gene whose product was detected. Each of the 
illustrative assays described below are amenable to high through-put analysis as 
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necessary to screen large numbers of degenerate sequences created by combinatorial 

mutagenesis techniques. 

In an illustrative embodiment of a screening assay, candidate combinatorial 
gene products of one of the subject proteins are displayed on the surface of a cell or 
5 virus, and the ability of particular cells or viral particles to bind a POSH or POSH- 
AP polypeptide is detected in a "panning assay". For instance, a library of POSH 
variants can be cloned into the gene for a surface membrane protein of a bacterial 
cell (Ladner et al.„ WO 88/06630; Fuchs et al., (1991) Bio/Technology 9:1370- 
1371; and Goward et al., (1992) TIBS 18:136-140), and the resulting fusion protein 
10 detected by panning, e.g., using a fluorescently labeled molecule which binds the 
POSH polypeptide, to score for potentially functional homologs. Cells can be 
visually inspected and separated under a fluorescence microscope, or, where the 
morphology of the cell permits, separated by a fluorescence-activated cell sorter. 

In similar fashion, the gene library can be expressed as a fusion protein on 
1 5 the surface of a viral particle. For instance, in the filamentous phage system, foreign 
peptide sequences can be expressed on the surface of infectious phage, thereby 
conferring two significant benefits. First, since these phage can be applied to 
affinity matrices at very high concentrations, a large number of phage can be 
screened at one time. Second, since each infectious phage displays the 
20 combinatorial gene product on its surface, if a particular phage is recovered from an 
affinity matrix in low yield, the phage can be amplified by another round of 
infection. The group of almost identical E. coli filamentous phages Ml 3, fd, and fl 
are most often used in phage display libraries, as either of the phage gill or gVHI 
coat proteins can be used to generate fusion proteins without disrupting the ultimate 
25 packaging of the viral particle (Ladner et al., PCT publication WO 90/02909; 
Garrard et al., PCT publication WO 92/09690; Marks et al., (1992) J. Biol. Chem. 
267:16007-16010; Griffiths et al., (1993) EMBO J. 12:725-734; Clackson et al., 
(1991) Nature 352:624-628; and Barbas et al., (1992) PNAS USA 89:4457-4461). 

The application also provides for reduction of the POSH or POSH-AP 
30 polypeptides to generate mimetics, e.g., peptide or non-peptide agents, which are 
able to mimic binding of the authentic protein to another cellular partner. Such 
mutagenic techniques as described above, as well as the thioredoxin system, are also 
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10 



15 



20 



25 



particularly useful for mapping the determinants of a POSH or POSH-AP 
polypeptide which participate in protein-protein interactions involved in for 
- example, binding of proteins involved in viral maturation to each other To 
illustrate, the critical residues of a POSH or POSH-AP polypeptide which are 
involved m molecular recognition of a substrate protein can be determined and used 
to generate its derivative peptidomimetics which bind to the substrate protein and 
by inhibiting POSH or POSH-AP binding, act to inhibit its biological activity By 
employing, for example, scanning mutagenesis to map the amino acid residues of a 
POSH polypeptide which are involved in binding to another polypeptide 
peptidomunetic compounds can be generated which mimic those residues involved 
m bmding For instance, non-hydrolyzable peptide analogs of such residues can be 
generated using benzodiazepine (e.g., see Freidinger et al., in Peptides: Chemistry 
and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988) 
azepine (e.g., see Huffman et al, in Peptides: Chemistry and Biology, G.R. Marshall' 
ed., ESCOM Publisher: Leiden, Netherlands, 1988), substituted gamma lactam rings 
(Garvey et al., in Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM 
Pubhsher: Leiden, Netherlands, 1988), keto-methylene pseudopeptides (Ewenson et 
al., (1986) J. Med. Chem. 29:295; and Ewenson et al., in Peptides: Structure and 
Function (Proceedings of the 9th American Peptide Symposium) Pierce C hemical 
Co. Rockland, IL, 1985), b-turn dipeptide cores (Nagai et al., (1985) Tetrahedron 
Lett 26:647; and Sato et al., (1986) J Chem Soc Perkin Trans 1:1231) and b- 
ammoalcohols (Gordon et al., (1985) Biochem Biophys Res Commun 126:419; and 
Dann et al, (1986) Biochem Biophys Res Commun 134:71). 

The following table provides the sequences of the RING domain and the 
various SH3 domains of POSH. 



Table 6. Amino Acid Sequences and related SEQ ID NOs for domains 
POSH 



in human 



Name of 
the 

sequence 
RING 

9372369 f 
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domain 






1 st SH 3 
domain 


or 1 a wat vMvwrTrpprjnT.KFqKGDII ILRRQVDENWYHGEVNGIHGF 
FPTNFVQIIK 


27 


2 nd SH 3 
domain 


PQCKALYDFEVKDKEADKDCLPFAKDDVLTVI RRVDENWAEGMIAD 
KIGIFPISYVEFNS 


28 


3 rd SH 3 
domain 


SVYVAIYPYTPRKEDELELRKGEMFIiVFERCQDGWFKGTSMHTSKI 
GVFPGNYVAPVT 


29 


4 th SH 3 
domain 


ERHR VWS YPPQSE AELELKEGD I VFVH KKREDGWFKGTLQRNGKT 
GLFPGSFVENI 


30 



The following table provides a list of selected POSH-APs and their related 
SEQ ID NOs. 



' 5 Table 7 - Selected POSH APs 



Protein 


Protein Sequence 
(SEQ ID NO:) 


mRNA Sequence 
(SEQ ID NO:) 


ARF1 J 


223 


325-339 


ARF5 


224 


340-344 


ATP6V0C 


225-226 


345-351 


CBL-B 


361; 398; 227-230 


353-360 


CENTB1 


231-232 _J 


37-47 


DDEF1 1 


233-237 


48-54 


EIF3S3 


238 


55-57 


EPS8L2 


239 


58-60 


GOCAP1 


240-243 


61-68 


GOSR2 


244-248 


69-76 


HERPUD1 


249-252 


77-86 [ 


HLA-A 


253 


87-88 


HLA-B 


254 


89 


MSTP028 


255-256 


90-94 


PACS-1 


362-366 


95-100 


PPP1CA 


261-263; 395 


101-110 


PPJCAR1A 


264-265 


111-122; 396-397 


PTPN12 


266-268 


123-129 


RALA 


269-270 


130-134 


SIAH1 


271-272 


135-141 


SMN1 


273-275 


142-146 


SMN2 


276-280 


147-151 


SNX1 


281-286 


152-161 


SNX3 


287-290 


162-174 
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Jrrocein 


Protein Sequence 
(SEO ID NO:) 


mRNA Sequence 
(SEO ID NO:) 


SR A1 


291-294 


175-182 


SYNE1 


295-307 




TTC3 


308-312 


202-207 


UBE2N 


313 


208-210 


UNC84B ~ 


314 


211-213 


VCY2IP1 


315-323 


214-222 


SPG20 


386-388 


367-374 


WASF1 


389 


375-376 


HIP55 


390-394 


377-385 



10 



Table 8 below provides a list of POSH-APs that bound POSH in a 2-hybrid 
assay. Nucleic acid and amino acid sequences of the POSH-APs listed in Table 8 
were filed in a U.S. provisional application filed in the name of Daniel N. Taglicht, 
Iris Alroy, Yuval Reiss, Liora Yaar, Danny Ben-Avraham, Shmuel Tuvia, and 
Tsvika Greener entitled "Posh Interacting Proteins and Related Methods", filed on 
March 2, 2004 (Attorney Docket No. PROL-P79-024), which Provisional 
Application is incorporated herein by reference in its entirety. 

Table 8 -POSH-APs 



Protein and Variant 



Protein Sequence 
(public gi No.) 



mRNA Sequence 
(public gi No.) 



BCL9 - var 1 



BRD4 - var 1 



BRD4 - var 2 



4757846 



19718731 



7657218 



4757845 



19718730 



7657217 



DRP2 - var 1 



MAP1A - var 1 



4503393 
21536458 



4503392 



21536457 



SH2D2A - var 1 



BAT3 - var 1 



4503633 



18375630 



31543620 



18375633 



BAT3 - var 2 



18375634 



18375631 



BAT3 - var 3 



BCAR1 - var 1 



7656924 



18375629 



7656923 



AP-var 1 
EVPL-var 1 



4758120 



4758119 



4503613 



4503612 



FLJ13231 - var 1 
FL53657- var 1 



38604073 



38604072 



13376230 



13376229 



HSPC142 



- var 1 
LOCI 18987- var 1 



7661802 



7661801 



29789403 
2443367 



31341089 
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..Jrrotera and Variant 


Protein Sequence 
(public gi No.) 


mRNA Sequence 1 
(public gi No.) 


ivd/vtouu — var I 


24416002 


24416001 


| .a. i jr J Jhro — var i 


20070264 


20070263 1 


1 Rc 1 1 t;or 1 

1 ns. j i d — var i 


37546355 


37546354 


1 A "R t/ar 1 

1 /ior 1x5 — var i 


8922549 


8922548 j 


/v X r D/vi — var l 


4757810 


23346425 1 


I i^o or n i— var i 


9954875 


39725662 — ! 


or iou — var i 


24431997 


24431996 J 


y-'D x i — var l 


16418337 


19923847 J 


j v-'iv^ — var i 


16507208 


16507207 ! 


1 vvJ-jvz — var I 


4557477 


4557476 J 


v^i^rvZ — var z 


4502883 


4502882 1 


jjinmz — var 1 


4826700 


4826699 ] 


cisr i A I — var 1 


4503471 


25453469 J 


JDir4r/jDr 1 — var 1 


4758258 


20070179 


rJUi — var 1 


24432013 


39725704 


i vjv^zu ■ — var i 


5031711 


5031710 _J 


1 flT T TT wov 1 

vjj^uj^ — var 1 


19923206 


21361767 | 


xixioJrZ — var l 


7657603 


7657602 | 


1 1 VxD- var l 


4504779 


4504778 "1 


ij/viviA j — var I 


21264602 


21264601 1 


I T nPOftOQ^ .... 1 

I / — var i 


29734345 


29734344 J 


A/TDPT t*o^ 1 

iVjUvrijjo — var 1 


23111040 


20806105 . 1 


xis.^ouyjj — var 1 


30149441 


37550602 ' ] 


inv^uz — var 1 


4505417 


4505416 


1 rL/Dr i — var 1 


5453854 


14141164 1 


ri^fs I J. — var I 


22035674 


35493922 


jr vxlj — var i 


984325 


984324 1 


1 1? a pen «m*> i * 
1 j\nJrou — var i 


21361593 


21361592 | 


I ixiNxx — var 1 


21361547 


21361546 J 


1 PPT vor 1 
{ XVTL — VdT 1 


4506597 


15431291 


I xvTtjzu — var i 


4506697 


14591915 | 


Xvioz. / j\ — var l 


4506713 


27436941 


1 ox> i jlsd i — var i 


6912652 


6912651 


SrHA? — var 1 

I or j/v£ — var i 


21361376 


32189413 




11024714 


22538474 j 


ARHV — var 1 




20070359 | 


KIAA1 1 11 - var 1 


32698700 




ZNF147-varl 


4827065 f 


15208652 


PAWR-varl 


4505613 


4505612 


TPX2 - var 1 


20127519 


31542258 


HSPAlB-var 1 


4885431 


26787974 


DLG5 - var 1 


3043690 


3650451 


DLG5 - var 2 T 


28466997 


28466996 ~] 


DLG5 - var 3 


3650452 


6549841 | 



9372369J 
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Protein and Variant 



Protein Sequence 
(public gi No.) 



mRNA Sequence 
(public gi No.) 



DLG5 - var 4 



* 



16807129 



DLG5 - var 5 



22539637 



DLG5 - var 6 



* 



15929207 



DLG5 - var 7 



KIAA1598- var 1 



KIAA1598- var 2 



7023592 



3043689 



10047271 



7023591 



7018519 



KIAA1598- var3 



* 



21314680 



KIAA1598-var4 



KIAA1598-var5 



* 



10047270 



21755030 



KIAA1598-var6 



21755023 



KIAA1598-var7 



* 



21754670 



KIAA1598-var8 



21750902 



KIAA1598-var9 



KIAA1598-var 10 



KIAA1598-varll 



CGI-27 - var 1 



CGI-27-var2 



CGI-27 -var 3 
CGI-27 -var 4 



7705720 



21749984 



21749775 



21749737 



23270696 



22902234 



17046302 



16553689 



CGI-27 -var 5 



10433504 



CGI-27 -var 6 



CGI-27 -var 7 
var 1 



BIA2- 



BIA2 - var 2 



* 



5262640 



21591225 



4680692 



20127543 



5262639 



21591224 



BIA2 - var 3 



21755615 



COLIA1 - var 1 



COLIA1 - var 2 



COLIA1 - var 3 



COLIA1 -var 4 



COLIA1 - var 5 



180392 



180857 



1418928 



22328092 



762938 



407589 



30015 



30092 



7209641 



22328091 



COLIA1 - var 6 



COLIA1 - var 7 



COLIA1 -var8 



COLIA1 - var 9 



30016 



DKFZp761A052 - var 1 



407590 



1'0434104 



1418927 



180856 



180391 



14719826 



10434103 



DKFZp761A052-var2 



DKFZp761A052-var3 
DKFZp761A052-var4 



10439058 
14602829 



DKFZp761A052-var5 



20380411 
6808165 



10439057 



14602828 



15079884 



20380410 



DKFZp761A052 - var 6 



TLEl-varl 
TLE1 - var 2 



14603281 



TLE1 - var 3 
EGLN2 - var 1 



307510 



6808164 



16041735 



14603280 



307509 



8922130 



9372369 1 
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Protein and Variant 


Protein Sequence 
(public gi No.) 


mRNA Sequence 
(public gi No.) 


EGLN2 - var 2 


12804603 


10437903 


EGLN2 - var 3 


14547148 


21733075 


EGLN2 - var 4 


18031805 


21758140 


EGLN2 - var 5 


* 


18677002 


EGLN2 — var 6 




18031804 


EGLN2 - var 7 


* 


18141576 


EGLN2 - var 8 




14547147 


EGLN2 - var 9 


* 


12804602 


EGLN2- var 10 


* 


10439822 


EGLN2 - var 1 1 


* 


8922129 


STC2 - var 1 


3335144 


3335143 


STC2 - var 2 


* 


3702223 


STC2 - var 3 




4050037 


STC2 - var 4 


* 


4104014 


STC2 - var 5 


* 


13623494 


STC2 - var 6 


* 


14042507 


STC2 - var 7 


* 


14042032 I 


STC2-var8 


* 


21755241 


STC2 - var 9 


* 


21755207 


STC2-var 10 


* 


22761473 


STC2-var 11 


* 


12653744 


OPTN - var 1 


20149572 


16550123 


OPTN - var 2 


21619683 


3387890 


OPTN -var 3 


3329431 


3127082 


OPTN -var 4 


3127083 


3329430 


OPTN -var 5 


* 


21619682 


OPTN- var 6 


* 


18644681 


OPTN -var 7 


* 


18644683 


OPTN- var 8 




18644685 


OPTN - var 9 


* 


20149571 


FLJ37147-varl 


21753535 


21753534 


FLJ37147-var2 


30153743 


30153742 


KHDRBS1 - var 1 


21749696 


189499 


KHDRBS1 -var 2 


1841747 


12653852 


KHDRBS1 - var 3 


189500 


17512262 


KHDRBS1 - var 4 


* 


14714433 


KHDRBS1 - var 5 


* 


1841746 


KHDRBS1 - var 6 


* 


21749695 


SLC2Al-var 1 


3387905 


3387904 


SLC2Al-var2 


5730051 


5730050 


SLC2Al-var3 


14268550 


14268549 


DKFZp434B1231 - var 1 


6808117 


6808116 


NUMA1 - var 1 


27694103 


5453819 


NUMA1 - var 2 


35119 


13278785 



9372369J 
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...Protein and Variant 


Protein Sequence 


mRNA Sequence 




(public gi No.) 


(public gi No.) 


NUMAl-var3 . . . . 


14249928 . ] 


14249927 


NUMA4 - var 4 


13278786 ! 


1 com ot/t 

15991876 


NUMA1 - var 5 


5453820 J 


zyol 18 


NUMA1 - var 6 


* 


29oll9 


NUMA1 - var 7 


* 


296120 ; 


NUMA1 - var 8 


* 


35118 


NUMA1 - var 9 


* 


20073234 


NUMAl-varlO 




22477305 


NUMA1 - var 1 1 


* 


22749583 


NUMA1 - var 12 


* 


27694102 


HSPC016-varl 


6841310 


12654536 j 


HSPC016-var2 


12654537 


6841309 


HSPC016-var3 


* 


4679017 


HSPC016-var4 




10834763 


UBC - var 1 


5912028 


3360475 J 


UBC - var 2 


340058 


2647407 


UBC -var 3 


340068 


24657521 


UBC - var 4 


14286308 


21751700 


UBC -var 5 


15928840 


21757163 


UBC - var 6 


16552475 


21758959 


UBC - var 7 


* 


16552474 | 


UBC - var 8 


* 


15928839 


UBC - var 9 


* 


14286307 | 


UBC -var 10 


* 


12653358 I 


UBC -var 11 




10439801 


UBC -var 12 


* 


340067 


UBC -var 13 




340057 


UBC -var 14 


* 


5912027 


ZFMl-var 1 


785999 


785998 


PIASY-var 1 


14603164 


3643110 


PIASY - var 2 


5533373 


5533372 


PIASY-var 3 


24850133 


1 /\ >1 O O OAO 

10433892 


PIASY - var 4 


3643111 


14603163 


PIASY - var 5 




20987516 


PIASY -var 6 




14709019 


XM 208944- var 1 


30153743 


OA1 f n >f O 

30153742 


J03930-var 1 


178442 


1 TO A A 1 

178441 


MTzA — var 1 


Io/jZo 


^7120 

J f L xWj 


MT2 A - var 2 


37121 


263506 


MT2 A - var 3 




13937856 


MT2A- var 4 


* 


1495465 


MT2 A - var 5 


* 


187527 


EWSRl-var 1 


7669490 


21734132 


EWSR1 -var2 


12653511 


547565 



9372369_1 
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Protein and Variant 


Protein Sequence | 
(public £i No.) 


— TVT A Cannon/*^ 

IMCJNA. oequence 
ipuDiic gi n u.j _j 


EWSR1 - var 3 


15029675 


Zl /DOJDO 


EWSR1 - var 4 


16552153 


10 / J 


EWSR1 - var 5 


16551674 1 


10DDZI3Z 


EWSR1 - var 6 


31280 1 


1 ^HOO^IA 

i juzyo /4 


EWSR1 - var 7 


* 1 




EWSR1 - var 8 


* | 


1U 


EWSR1 - var 9 


* j 


1 flAIOfWl 

iU4jyu/^ _j 


EWSRl-varlO 


* 1 




MADH6 - var 1 


2828712 


1CO4jZ0 _j 


MADH6 — var 2 


2736316 


ZUj /iOU4 


MADH6 - var 3 


1654327 1 


Z/3o31-> 


MADH6 — var 4 


* 1 


OCOQ71 1 
ZoZo /ll 


MADH6 - var 5 


* 


1 

iDZ/oUSy 


THOC2 - var 1 


20799318 


i a /i o c/C /i o 


THOC2-var2 


10435650 J 


zu /yyi I / 


THOC2 - var 3 


* 


70Z3ZZ4 


ZNF151 - varl 


676873 


ZZJUo /U 


ZNF151 - var2 


2230871 


0/00 /z 


DDX31 - var 1 


10435700 


1 /I A /I O 1 

14U4Ziy3 


DDX31-var2 


10440004 


15Z1DZ/Z 


DDX31-var3 


20336298 


10DOOj4y 


DDX31-var4 


16566550 


O A"2 1 /CO 0*7 


DDX31-var5 


15215273 


ZUiioZyo 


DDX31-var6 


14042194 


1 A/I /! AAA*2 


DDX31 - var 7 


* 


1 A/1 *3 C/CftO 


POLR2J2 - var 1 


11595478 


217U4Z/1 


POLR2J2 - var 2 


21704274 


O 1 ^7 A/1 T7A 

217U4Z/U 


POLR2J2 - var 3 


19401711 


1 Q/1 A1 T1 A 
| 1 94U 1 / 1 U j 


POLR2J2 - var 4 


14702175 


1 O 1 ^7A/10*71 

Zl /U4Z/3 


POLR2J2 - var 5 


21704272 


| IO0/0U0J 


POLR2J2 - var 6 


* 


| 1 1 DyD4 / 3 


POLR2J2 - var 7 


* 


j 1 l!>yD4/ / t 


POLR2J2 - var 8 




1 1 S.Q^AH'X 
j llDyD4/j 


BANF1 - var 1 


«t /-\ f-\ r~\ r~ 1 

300295 l 


1 1 O^B^A^ 
1 1 lUjoO'fj 


BANF1 - var 2 


4502389 


| 13D43D/0 


BANP1 - var 3 




| 14/l->yU/ 


BANF1 - var 4 




| jUUZIOU 


BANF1 - var 5 




I 43Ziy/D 


BANF1 - var 6 


* 


3220254 


PDV/I var 1 

K^DAst — Vol I 


1945453 


1945452 


CBX4-var2 


15929016 


2317722 


CBX4-var3 


2317723 


15929015 


ARJH2 - var 1 


3925604 


3925603 


ARIH2 - var 2 


9963793 


I 3930777 


9372369_1 
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Protein and Variant 


X t UlCliJ LjCUUClltC 

( mi hi if* <j\ Mo ^ 


mRNA Sequence 
(public gi No.) 


ARIH2 - var 3 


12653307 

*■ \J —> ~J \J 1 




ARIH2 - var 4 


* 




ARJH2 - var 5 






AJRJH2 - var 6 


* 




ARIH2 - var 7 


* 


Ooo!>oU2 


ARIH2 - var 8 




Z1/49DO j 


ARIH2 - var 9 


* 


JJo/j424 J 


ARIH2 - var 10 






ARIH2 - var 1 1 






ARIH2 - var 12 




< 0 1 *7 1 AA 1 

JoI/lUO 1 


1 ARIH2 - var 13 






SRPK2 - var 1 


1 857944 


21752284 


SRPK2 - var 2 


23270876 


21749007 


SRPK2-var3 


* 


ZJ270875 


SRPK2 - var 4 




1 OD /943 | 


SIAH2 - var 1 


2673968 


10D4S/991 


1 SIAH2 - var 2 


2664283 


34189635 1 


SIAH2-var3 


* 


ZDD4282 J 


SIAH2-var4 


* 


Zo/39o7 | 


KIAA0191 - var 1 


27480017 


ZyJis/Zol 1 


KIAA0191-var2 j 


1228035 


1U43o3UU 


KIAA0191 - var 3 


29387262 


i oodao a i 
IzzoU34 


KIAA0191 -var4 


* 


Once act i 

Zl /D5057 


KIAA0191-var5 




1*7/1 QAA1 £. t 


KIAA0191 - var6 




19387907 


KIAA0191 - var? 


* 


Oo3do51 


KIAA0191 - var 8 






PAl-RBPl-varl 


5262551 


Zz/oU/ol 


PA1-RBP1 - var 2 


4929579 


ZUU / Z47 / | 


PA1-RBP1 - var 3 


12804377 


1 7Q70/1 C< 1 

1 /5oy4jo 


PAl-RBPl-var4 


12803339 l 


I ol/ooZ4J I 


PAl-RBPl-var5 


14029171 




PAl-RBPl-var6 


18088244 




PA1-RBP1 - var 7 


22760762 


1 *f 1 / U 


PA1-RBP1 - var 8 


* 


jjo/O/h-17 


PA1-RBP1 - var 9 


* 




PA1-RBP1 - var 10 


* 


4929578 ~J 


1 PA1-RBP1 - var 11 


* 


4406639 1 


PAl-RBPl-varl2 




5262550 ~~J 


FAT - var 1 


2281025 


1107686 1 


FAT - var 2 


1107687 


15214611 | 


FAT - var 3 


* 


2281024 1 


FAT -var 4 


* 


598748 1 


VOL- var 1 


24657579 


7669551 



9372369J 
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.. Protein and Variant 


Protein Sequence 
(public ei No.) 


mRNA Sequence 
(public gi No.) 


VCL-var2 ! 


340237 1 


7669549 


VCL — var 3 


7669550 


340236 


VCL-var4 


* 


21732673 


VCL-var5 


* ! 


15426616 


VCL-var6 


* 


246657578 


SSR4 - var 1 


15929882 


30583222 


SSR4-var2 


13097213 


1071680 


SSR4-var 3 


* 


22749791 


SSR4 - var 4 


* 


21753447 


SSR4-var 5 


* 


16552704 


SSR4-var6 


* 


15929881 


SSR4 - var 7 


* 


13097212 


SSR4-var 8 


* 


2398656 


PRDX5 - var 1 


6166493 


27484966 


PRDX5 - var 2 


6746355 


9802047 


PRDX5 - var 3 


9802048 


8745393 


PRDX5 - var 4 


27484967 


6746354 


PRDX5 - var 5 


* 


6563211 


PRDX5 - var 6 




6103723 


PRDX5 - var 7 


* 


6166492 


PRDX5 - var 8 


* 


6523288 


PRDX5 - var 9 




32455258 


FLJ10120-var 1 


8922239 


27469671 


FLJ10120-var2 


* 


8922238 J 


PROL4 - var 1 


22208536 


22208535 


PROL4 - var 2 


6005802 


1050982 


CL25084 - var 1 


15341891 


4406555 


CL25084 - var 2 


7023472 


4406692 ! 


CL25084 - var 3 


4406693 


7023471 | 


CL25084-var4 


4406556 


15341890 


Cllorfl7-var 1 


22761313 


21361869 


Cllorfl7- var2 


21105773 


20149226 


Cllorfl7- var 3 


20149225 


20149224 


Cllorfl7- var4 


20149227 


21105772 


Cllorfl7- var 5 


21361870 


21410957 


Cllorfl7- var6 


* 


22761312 


POLQ-var 1 


3510695 


13892060 


POLQ - var 2 


4163931 


13892060 


POLQ - var 3 


13892061 


4163930 


POLQ - var 4 




3510694 


MBD2 - var 1 


3170202 


3800812 


MBD2 - var 2 


3800801 


5817231 


MBD2 - var 3 


7710145 


21595775 


MBD2 - var 4 


21595776 


21464120 
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Protein and Variant 


Protein Sequence 
(public gi No.) 


mRNA Sequence 
(public gi No.) 


MBD2 - var 5. 


* 


21464121 


MBD2 - var 6 


* 


3800800 


MBD2 - var 7 


* 


3800792 


MBD2 - var 8 


* 


3170201 J 


FSTL1 - var 1 


12658309 


536897 


FSTLl-var2 


12652619 


16924272 


FSTLl-var3 


* 


33990756 


FSTL1 - var 4 


* 


12658308 


FSTL1 - var 5 


* 


10438502 


FSTL1 - var 6 


* 


4884472 



* denotes a polypeptide sequence that can be deduced from the corresponding 
mRNA sequence. 

5 

9. Effective Dose 

Toxicity and therapeutic efficacy of such compounds can be determined by 
standard pharmaceutical procedures in cell cultures or experimental animals, e.g., 
for determining The LD50 (the dose lethal to 50% of the population) and the ED50 

10 (the dose therapeutically effective in 50% of the population). The dose ratio 
between toxic and therapeutic effects is the therapeutic index and it can be expressed 
as the ratio LD50/ED50. Compounds which exhibit large therapeutic induces are 
preferred. While compounds that exhibit toxic side effects may be used, care should 
be taken to design a delivery system that targets such compounds to the site of 

15 affected tissue in order to minimize potential damage to uninfected cells and, 
thereby, reduce side effects. • 

The data obtained from the cell culture assays and animal studies can be used 
in formulating a range of dosage for use in humans. The dosage of such compounds 
lies p referably w ithin a range o f c irculating c oncentrations t hat i nclude t he E D50 

20 with little or no toxicity. The dosage may vary within this range depending upon the 
dosage form employed and the route of administration utilized. For any compound 
used in the method of the application, the therapeutically effective dose can be 
estimated initially from cell culture assays. A dose may be formulated in animal 
models to achieve a circulating plasma concentration range that includes the IC50 

25 (i.e., the concentration of the test compound which achieves a half-maximal 
9372369_1 
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inhibition of symptoms) as detennined in cell culture. Such information can be used 
to more accurately determine useful doses in humans. Levels in plasma may be 
measured, for example, by high performance liquid chromatography. 

5 10. Formulation and Use 

Pharmaceutical compositions for use in accordance with the present 
application may be formulated in conventional manner using one or more 
physiologically acceptable carriers or excipients. Thus, the compounds and their 
physiologically acceptable salts and solvates may be formulated for administration 
10 by, for example, injection, inhalation or insufflation (either through the mouth or the 
nose) or oral, buccal, parenteral or rectal administration. 

An exemplary composition of the application comprises an RNAi mixed 
with a delivery system, such as a liposome system, and optionally including an 
acceptable excipient. In a preferred embodiment, the composition is formulated for 
15 topical administration for, e.g., herpes virus infections. 

For such therapy, the compounds of the application can be formulated for a 
variety of loads of administration, including systemic and topical or localized 
administration. Techniques and formulations generally may be found in 
Remmington's P harmaceutical S ciences, M eade P ublishing C o., E aston, PA. F or 
20 systemic administration, injection is preferred, including intramuscular, intravenous, 
intraperitoneal, and subcutaneous. For injection, the compounds of the application 
can be formulated in liquid solutions, preferably in physiologically compatible 
buffers such as Hank's solution or Ringer's solution. In addition, the compounds 
may be formulated in solid form and redissolved or suspended immediately prior to 
25 use. Lyophilized forms are also included. 

For oral administration, the pharmaceutical compositions may take the form 
of, for example, tablets or capsules prepared by conventional means with 
pharmaceutically acceptable excipients such as binding agents (e.g., pregelatinised 
maize starch, polyvinylpyrrolidone or hydroxypropyl methylcellulose); fillers (e.g., 
30 lactose, microcrystalline cellulose or calcium hydrogen phosphate); lubricants (e.g., 
magnesium stearate, talc or silica); disintegrants (e.g., potato starch or sodium starch 
glycolate); or wetting agents (e.g., sodium lauryl sulphate). The tablets may be 
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coated by methods well known in the art. Liquid preparations for oral 
administration may take the form of, for example, solutions, syrups or suspensions, 
^ - or they- may be presented as a dry product for constitution with water or other 
suitable vehicle before use. Such liquid preparations may be prepared by 
5 conventional means with pharmaceutically acceptable additives such as suspending 
agents (e.g., sorbitol syrup, cellulose derivatives or hydrogenated edible fats); 
emulsifying agents (e.g., lecithin or acacia); non-aqueous vehicles (e.g., ationd oil, 
oily esters, ethyl alcohol or fractionated vegetable oils); and preservatives (e.g., 
methyl orpropyl-p-hydroxybenzoates or sorbic acid). T he preparations may also 
1 0 contain buffer salts, flavoring, coloring and sweetening agents as appropriate. 

Preparations for oral administration may be suitably formulated to give 
controlled release of the active compound. For buccal administration the 
compositions may take the form of tablets or lozenges formulated in conventional 
manner. For administration by inhalation, the compounds for use according to the 
15 present application are conveniently delivered in the form of an aerosol spray 
presentation from pressurized packs or a nebuliser, with the use of a suitable 
propellant, e.g., dichlorodifluoromethane, trichlorofluoromethane, 
dichlorotetrafluoroethane, carbon dioxide or other suitable gas. In the case of a 
pressurized aerosol the dosage unit may be determined by providing a valve to 
20 deliver a metered amount. Capsules and cartridges of e.g., gelatin for use in an 
inhaler or insufflator may be formulated containing a powder mix of the compound 
and a suitable powder base such as lactose or starch. 

The compounds may be formulated for parenteral administration by 
injection, e.g., by bolus injection or continuous infusion. Formulations for injection 
25 may be presented in unit dosage form, e.g., in ampoules or in multi-dose containers, 
with an added preservative. The compositions may take such forms as suspensions, 
solutions or emulsions in oily or aqueous vehicles, and may contain formulatory 
agents such as suspending, stabilizing and/or dispersing agents. Alternatively, the 
active i ngredient m ay b e i n p owder form for c onstitution w ith a s uitable v ehicle, 
30 e.g., sterile pyrogen-firee water, before use. 
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The compounds may also be formulated in rectal compositions such as 
suppositories or retention enemas, e.g., containing conventional s uppository b ases 
such as cocoa butter or other glycerides. 

In addition to the formulations described previously, the compounds may 
5 also be formulated as a depot preparation. Such long acting formulations may be 
administered by implantation (for example subcutaneously or intramuscularly) or by 
intramuscular injection. Thus, for example, the compounds may be formulated with 
suitable polymeric or hydrophobic materials (for example as an emulsion in an 
acceptable oil) or ion exchange resins, or as sparingly soluble derivatives, for 
1 0 example, as a sparingly soluble salt. 

Systemic administration can also be by transmucosal or transdermal means. 
For transmucosal or transdermal administration, penetrants appropriate to the barrier 
to be permeated are used in the formulation. Such penetrants are generally known in 
the art, and include, for example, for transmucosal administration bile salts and 
1 5 fusidic acid derivatives, in addition, detergents may be used to facilitate permeation. 
Transmucosal administration may be through nasal sprays or using suppositories. 
For topical administration, the oligomers of the application are formulated into 
ointments, salves, gels, or creams as generally known in the art. A wash solution 
can be used locally to treat an injury or inflammation to accelerate healing. 
20 The compositions may, if desired, be presented in a pack or dispenser device 

which may contain one or more unit dosage forms containing the active ingredient. 
The pack may for example comprise metal or plastic foil, such as a blister pack. The 
pack or dispenser device may be accompanied by instructions for administration. 

For therapies involving the administration of nucleic acids, the oligomers of 
25 the application can be formulated for a variety of modes of administration, including 
systemic and topical or localized administration. Techniques and formulations 
generally may be found in Remmington's Pharmaceutical Sciences, Meade 
Publishing Co., Easton, PA. For systemic administration, injection is preferred, 
including intramuscular, intravenous, intraperitoneal, intranodal, and subcutaneous 
30 for injection, the oligomers of the application can be formulated in liquid solutions, 
preferably in physiologically compatible buffers such as Hank's solution or Ringer's 
solution. In addition, the oligomers may be formulated in solid form and 
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redissolved or suspended immediately prior to use. Lyophilized forms are also 
included. 

Systemic administration can also be by transmucosal or transdermal means, 
or the compounds can be administered orally. For transmucosal or transdermal 
5 administration, penetrants appropriate to the barrier to be permeated are used in the 
formulation. Such penetrants are generally known in the art, and include, for 
example, for transmucosal administration bile salts and fusidic acid derivatives. In 
addition, detergents may be used to facilitate permeation. Transmucosal 
administration may be through nasal sprays or using suppositories. For oral 
10 administration, the oligomers are formulated into conventional oral administration 
forms such as capsules, tablets, and tonics. For topical administration, the oligomers 
of the application are formulated into ointments, salves, gels, or creams as generally 
known in the art. 

15 The application now being generally described, it will be more readily 

understood by reference to the following examples, which are included merely for 
purposes of illustration of certain aspects and embodiments of the present 
application, and are not intended to limit the application. 

20 EXAMPLES 

Example 1. Role of POSH in virus-like particle (VLP^ budding 
1. Objective: 

Use RNAi to inhibit POSH gene expression and compare the efficiency of 
viral budding and GAG expression and processing in treated and untreated cells. 

25 2. Study Plan: 

HeLa SS-6 cells are transfected with mRNA-specific RNAi in order to 
knockdown the target proteins. Since maximal reduction of target protein by RNAi 
is achieved after 48 hours, cells are transfected twice - first to reduce target mRNAs, 
and subsequently to express the viral Gag protein. The second transfection is 

30 performed with pNLenv (plasmid that encodes HIV) and with low amounts of RNAi 
to maintain the knockdown of target protein during the time of gag expression and 
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budding of VLPs. Reduction in mRNA levels due to RNAi effect is verified by RT- 
PCR amplification of target mRNA. 
3. Methods, Materials, Solutions 

a. Methods 

i. Transfections according to manufacturer's protocol and as described in 
procedure. 

ii. Protein determined by Bradford assay. 

iii. SDS-PAGE in Hoeffer miniVE electrophoresis system. Transfer in Bio- 
Rad mini-protean II wet transfer system. Blots visualized using Typhoon system, 
and ImageQuant software (ABbiotech) 

b. Materials 



Material 


Manufacturer 


Catalog # 


Batch # 


Lipofectamine 2000 
(LF2000) 


Life Technologies 


11668-019 


1112496 


OptiMEM 


Life Technologies 


31985-047 


3063119 


RNAi Lamin A/C 


Self 


13 




RNAi TSG101 688 


Self 


65 




RNAi Posh 524 


Self 


81 




plenvll PTAP 


Self 


148 




plenvll ATAP 


Self 


149 




Anti-p24 polyclonal 
antibody 


Seramun 




A-0236/5- 
10-01 


Anti-Rabbit Cy5 
conjugated antibody 


Jackson 


144-175-115 


48715 


10% acrylamide Tris- 
Glycine SDS-PAGE gel 


Life Technologies 


NP0321 


1081371 


Nitrocellulose 
membrane 


Schleicher & 
Schuell 


401353 


BA-83 


NuPAGE 20X transfer 
buffer 


Life Technologies 


NP0006-1 


224365 


0.45 urn filter 


Schleicher & 


10462100 


CS 101 8-1 
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1 


Schuell . 







c. Solutions 



Lysis Buffer 


Compound 


Concentration 


Tris-HCl pH 7.6 


50mM 


MgCl 2 


15mM 


NaCl 


150mM 


Glycerol 


10% 


EDTA ; 


ImM 


EGTA 


ImM 




ASB-14 (add immediately 
before use) 


1% 


6X Sample 
Buffer 


Tris-HCl, pH=6.8 


1M 


Glycerol 


30% 


SDS 


10% 


DTT 


9.3% 


Bromophenol Blue 


0.012% 


TBS-T 


Tris pH=7.6 


20mM 


NaCl 


137mM 


Tween-20 


0.1% 



4. Procedure 
5 a. Schedule 



Day ' 


1 


2 


3 


4 


5 


Plate 


Transfection 


Passage 


Transfection II 


Extract RNA 


cells 


I 


cells 


(RNAi and 


for RT-PCR 




(RNAi only) 


(1:3) 


pNlenv) 


(post 








(12:00, PM) 


transfection) 
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Extract RNA for 

RT-PCR 

(pre-transfection) 


Harvest VLPs 
and cells 



b. Dayl 

Plate HeLa SS-6 cells in 6-well plates (35mm wells) at concentration of 5 XI 0" 
cells/well. 
5 c. Day 2 

2 hours before transfection replace growth medium with 2 ml growth medium 
without antibiotics. 

Transfection I: 











RNAi [nM] 


RNAi 
[20uMJ 


A B 

OPtiMEM LF2000 


Reaction 


RNAi name 


TAGDA# Reactions 




Ml 


(MD 


(Ml) 


1 


Lamin A/C 


13 


2 


50 


12.5 


500 


500 


2 


Lamin A/C 


13 


1 


50 


6.25 


250 


250 


3 


TSG101 688 


65 


2 


20 


5 


500 


500 


5 


Posh 524 


81 


2 


50 


12.5 


500 


500 



10 Transfections: 

Prepare LF2000 mix: 250 \xl OptiMEM + 5 |nl LF2000 for eacty reaction. Mix by 
inversion, 5 times. Incubate 5 minutes at room temperature. 
Prepare RNA dilution in OptiMEM (Table 1, column A). Add LF2000 mix 
dropwise to diluted RNA (Table 1 , column B). Mix by gentle vortex. Incubate at 

1 5 room temperature 25 minutes, covered with aluminum foil. 

Add 500 nl transfection mixture to cells dropwise and mix by rocking side to 
side. 

Incubate overnight. 

d. Day 3 

20 Split 1 :3 after 24 hours. (Plate 4 wells for each reaction, except reaction 2 which 

is plated into 3 wells.) 

e. Day 4 
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-. 2 hours pre-transfection replace medium with DMEM growth medium without 
antibiotics. 

. i 

Transfection II 











A 


B 


c 


D 












RNAi 














Plasmid 


[20uMJ for 






RNAi 


TAG 




Reaction for 2.4 ug 


10nM 


OPtiMEM 


LF2000 mix 


name 


DA# 


Plasmid 


# 


(Ml) 


(Ml) 


(Pi) 


(Ml) 


Lamin 








3.4 








A/C 


13 


PTAP 


3 




3.75 


750 


750 


Lamin 








2.5 








A/C 


13 


ATAP 


3 




3.75 


750 


750 


TSG101 








3.4 








688 


65 


PTAP 


3 




3.75 


750 


750 


Posh 524 


81 


PTAP 


3 


3.4 


3.75 


750 


750 



5 Prepare LF2000 mix: 250 ul OptiMEM + 5 ul LF2000 for each reaction. Mix by 

inversion, 5 times. Incubate 5 minutes at room temperature. 
Prepare RNA+DNA diluted in OptiMEM (Transfection II, A+B+C) 
Add LF2000 mix (Transfection II, D) to diluted RNA+DNA dropwise, mix by 
gentle vortex, and incubate lh while protected from light with aluminum foil. 
10 Add LF2000 and DNA+RNA to cells, 500ul/well, mix by gentle rocking and 

incubate overnight. 

f. Day 5 

Collect samples for VLP assay (approximately 24 hours post-transfection) by the 
following procedure (cells from one well from each sample is taken for RNA 
15 assay, by RT-PCR). 

g. Cell Extracts 

i. Pellet floating cells by centrifugation (5min, 3000 rpm at 4 °C), save 
supernatant (continue with supernatant immediately to step h), scrape 
remaining cells in the medium which remains in the well, add to the 
20 corresponding floating cell pellet and centrifuge for 5 minutes, 1 800rpm at 



4°C. 
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ii. Wash cell pellet twice with ice-cold PBS. 

iii. Resuspend cell pellet in 100 (xl lysis buffer and incubate 20 minutes on 
ice. 

iv. Centrifuge at 14,000 rpm for 15 min. Transfer supernatant to a clean 
5 tube. This is the cell extract. 

v. Prepare 10 of cell extract samples for SDS-PAGE by adding SDS- 
PAGE sample buffer to IX, and boiling for 10 minutes. Remove an aliquot 
of the remaining sample for protein determination to verify total initial 
starting material. Save remaining cell extract at -80 °C. 

10 h. Purification of VLPs from cell media 

i. Filter the supernatant from step g through a 0.45m filter. 

ii. Centrifuge supernatant at 14,000 rpm at 4 °C for at least 2 h. 

iii. Aspirate supernatant carefully. 

iv. Re-suspend VLP pellet in hot (100 °C warmed for 10 min at least) IX 
15 sample buffer. 

v. Boil samples for 10 minutes, 100 °C. 
i. Western Blot analysis 

i. Run all samples from stages A and B on Tris-Glycine SDS-PAGE 10% 
(120V for 1.5 h). 

20 ii. Transfer samples to nitrocellulose membrane (65V for 1 .5 h). 

iii. Stain membrane with ponceau S solution. 

iv. Block with 10% low fat milk in TBS-T for 1 h. 

v. Incubate with anti p24 rabbit 1 :500 in TBS-T o/n. 

vi. Wash 3 times with TBS-T for 7 min each wash. 

25 vii. Incubate with secondary antibody anti rabbit cy5 1 :500 for 30 min. 

viii. Wash five times for 10 min in TBS-T. 

ix. View in Typhoon gel imaging system (Molecular Dynamics/APBiotech) 
for fluorescence signal. 

Results are shown in Figures 11-13. 

30 

Example 2. Exemplary POSH RT-PCR primers and siRNA duplexes 
RT-PCR primers 
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Name 


Position 


Sequence 


Sense primer 


POSH=271 


271 


5* CTTGCCTTGCCAGCATAC 3' (SEQIDNO:12) 


Anti-sense 
primer 


POSH=926c 


926C 


5' CTGCCAGCATTCCTTCAG 3' (SEQIDNO:13) 



siRNA duplexes: 

siRNA No: 
siRNA Name: 
Position in mRNA 
Target sequence: 
siRNA sense strand: 
siRNA anti-sense strand: 



153 

POSH-230 
426-446 

5' AACAGAGGCCTTGGAAACCTG 3' 

5' dTdTCAGAGGCCUUGGAAACCUG 3* 

SMTdTCAGGUUUCCAAGGCCUCUG 3' 



SEQ ID NO: 14 
SEQ ID NO: 15 
SEQ ID NO: 16 



10 siRNA No: 
siRNA Name: 
Position in mRNA 
Target sequence: 
siRNA sense strand: 

1 5 siRNA anti -sense strand: 



155 

POSH-442 
638-658 

5' AAAGAGCCTGGAGACCTTAAA 3' 
5' ddTdTAGAGCCUGGAGACCUIJAAA 3' SEQ ID NO: 18 
5' ddTdTUUUAAGGUCUCCAGGCUCU3* SEQ ID NO: 19 



SEQ ID NO: 17 



20 



siRNA No: 
siRNA Name: 
Position in mRNA 
Target sequence: 
siRNA sense strand: 
siRNA anti-sense strand: 



157 

POSH-U111 
2973-2993 

5' AAGGATTGGTATGTGACTCTG 3' 

5' dTdTGGAUUGGUAUGUGACUCUG 3' 

5' dTdTCAGAGUCACAUACCAAUCC 3' 



SEQ ID NO: 20 
SEQ ID NO: 21 
SEQ ID NO: 22 



siRNA No: 
25 siRNA Name: 

Position in mRNA 
Target sequence: 
siRNA sense strand: 



159 

POSH-U410 
3272-3292 

5' AAGCTGGATTATCTCCTGTTG 3' 
5' ddTdTGCUGGAUUAUCUCCUGUUG 3' 



SEQ ID NO: 23 
SEQ ID NO: 24 
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siRNA anti-sense strand: 5' ddTdTCAACAGGAG AUAAUCCAGC 3' SEQ ID NO: 25 

187 

POSH-control 
None. Reverse to #153 

5' AAGTCCAAAGGTTCCGGAGAC 3' "SEQ ID 

3. Knock-down of hPOSH entraps HIV virus particles in intrace llular vesicles. 
10 HIV virus release was analyzed by electron microscopy following siRNA 

and full-length HIV plasmid (missing the envelope coding region) transfection. 
Mature viruses were secreted by cells transfected with HIV plasmid and non- 
relevant siRNA (control, lower panel). Knockdown of TsglOl protein resulted in a 
budding defect, the viruses that were released had an immature phenotype (upper 
15 panel). Knockdown of hPOSH levels resulted in accumulation of viruses inside the 
cell in intracellular vesicles (middle panel). Results, shown in Figure 28, indicate 
that inhibiting hPOSH entraps HIV virus particles in intracellular vesicles. As 
accumulation of HTV virus particles in the cells accelerate cell death, inhibition of 
hPOSH therefore destroys HIV reservoir by killing cells infected with HIV. 

20 

Example 4. In-vitro assav of Human POSH sel f-ubiquitination 

Recombinant hPOSH was incubated with ATP in the presence of El, E2 and 
ubiquitin as indicated in each lane. Following incubation at 37 °C for 30 minutes, 
25 reactions w ere t erminated b y a ddition o f S DS-PAGE s ample b uffer. T he s amples 
were subsequently resolved on a 10% polyacrylamide gel. The separated samples 
were then transferred to nitrocellulose and subjected to immunoblot analysis with an 
anti ubiquitin polyclonal antibody. The position of migration of molecular weight 
markers is indicated on the right. 
30 Poly-Ub: Ub-hPOSHconjugates, detected as high molecular weight adducts only in 
reactions containing El, E2 and ubiquitin. hPOSH-176 and hPOSH-178 are a short 



^ - siRNA-No.: 
siRNA Name: 

5 Position in mRNA: 
Target sequence: 
NO: 36 
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10 



and a longer derivatives (respectively) of bacterially expressed hPOSH; C, control 
E3. 

Preliminary steps in a high-throughput screen 
Materials 

1. El recombinant from bacculovirus 

2. E2 Ubch5c from bacteria 

3. Ubiquitin 

4. POSH #178 (1-361) GST fusion-purified but degraded 

5. POSH # 176 (1-269) GST fusion-purified but degraded 

6. hsHKDl soluble ring containing region 

5. Bufferxl2 (Tris 7.6 40 mM, DTT ImM, MgCl 2 5mM, ATP 2uM) 

6. Dilution buffer (Tris 7.6 40mM, DTT ImM, ovalbumin lug/ul) 
protocol 





0. lug/ul 


0.5ug/ul 


5ug/ul 


0.4ug/ul 


2.5ug/u/ 


0.8ug/ul 






El 


E2 


Ub 


176 


178 


Hrdl 


Bxl2 


-El (E2+176) 




0.5 


0.5 








10 


-E2 (El+176) 


1 




0.5 








9.5 


-ub (E1+E2+176) 


1 


0.5 










9.5 


El+E2+176+Ub 


1 


0.5 


0.5 








9 I 


-El (E2+178) 




0.5 


0.5 




1 




10 


-E2 (El+178) 


1 




0.5 




1 




9.5 


-ub (E1+E2+178) 


1 


0.5 






1 




9.5 


El+E2+178+Ub 


1 


0.5 


0.5 




1 


1 


9 


Hrdl, El+E2+Ub 


1 


0.5 


0.5 






8.5 



15 



20 



1. Incubate for 30 minutes at 37 °C. 

2. Run 12% SDS PAGE gel and transfer to nitrocellulose membrane 

3. Incubate with anti-Ubiquitin antibody. 

Results, shown in Figure 19, demonstrate that human POSH has 
ubiquitin ligase activity. 
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Exam ple 5. Co-immunoprecipitation of hPOSH with mvc-tagge d activated (VI 2) 
and dominant-negative (N17) Racl 

HeLa cells were transfected with combinations of myc-Racl V12 or N17 and 
hPOSHdelRING-V5. 24 hours after transfection (efficiency 80% as measured by 

5 GFP) cells were collected, washed with PBS, and swollen in hypotonic lysis buffer 
(10 mM HEPES pH=7.9, 15 mM KC1, 0.1 mM EDTA, 2 mM MgC12, 1 mM DTT, 
and protease inhibitors). Cells were lysed by 10 strokes with dounce homogenizer 
and centrifuged 3000xg for 10 minutes to give supernatant (Fraction 1) and nucleii. 
Nucleii were washed with Fraction 2 buffer (0.2% NP-40, 10 mM HEPES pH=7.9, 

10 40 mM KC1, 5% glycerol) to remove peripheral proteins. Nucleii were spun-down 
and supernatant collected (Fraction 2). Nuclear proteins were eluted in Fraction 3 
buffer (20 mM HEPES pH=7.9, 0.42 M KC1, 25% glycerol, 0.1 mM EDTA, 2 mM 
MgCl 2 , 1 mM DTT) by rotating 30 minutes in cold. Insoluble proteins were spun- 
down 14000xg and solubilized in Fraction 4 buffer (1% Fos-Choline 14, 50 mM 

15 HEPES pH=7.9,- 150 mM NaCl, 10% glycerol, ImM EDTA, 1.5 mM MgCl 2 , 2 mM 
DTT). Half of the total extract was pre-cleared against Protein A sepharose for 1.5 
hours and used for IP with 1 ug anti-myc (9E10, Roche 1-667-149) and Protein A 
sepharose for 2 hours. Immune complexes were washed extensively, and eluted in 
SDS-PAGE sample buffer. Gels were run, and proteins electro-transferred to 

20 nitrocellulose for immunoblot as in Figure 20. Endogenous POSH and transfected 
hPOSHdelRING-V5 are precipitated as a complex with Myc-Racl V12/N17. 
Results, s hown i n F igure 2 0, d emonstrate t hat POSH c o-immunoprecipitates w ith 
Racl. 

25 Example 6. POSH reduction results in decreased secreti on of phospholioase D 
(PLD) 

Hela SS6 cells (two wells of 6-well plate) were transfected with POSH 
siRNA or control siRNA (100 nM). 24 hours later each well was split into 5 wells of 
30 a 24-well plate. The next day cells were transfected again with 1 00 nM of either 
POSH siRNA or control siRNA. The next day cells were washed three times with 
lxPBS and than 0.5 ml of PLD incubation buffer (118 mM NaCl, 6 mM KC1, 1 mM 
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CaCl 2 , 1-2 mM MgS04, 12.4 mM HEPES, pH7.5 and 1% fatty acid free bovine 
serum albumin) were added. 
- -48 hours later medium was collected and centrifuged at 800xg for 1 5 

minutes. The medium was diluted with 5xPLD reaction buffer (Amplex red PLD kit) 

5 and assayed for PLD by using the Amplex Red PLD kit (Molecular probes, A- 
12219). The assay results were quantified and presented below in as a bar graph. 
The cells were collected and lysed in 1% Triton X-100 lysis buffer (20 mM HEPES- 
NaOH, pH 7.4, 150 mM NaCl, 1.5 mM MgCl 2 , 1 mM EDTA, 1% Triton X-100 and 
lx protease inhibitors) for 15 minutes on ice. Lysates were cleared by centrifugation 

10 and protein concentration was determined. There were equal protein concentrations 
between the two transfectants. Equal amount of extracts were immunoprecipitated 
with anti-POSH antibodies, separated by SDS-PAGE and immunoblotted with anti- 
POSH antibodies to assess the reduction of POSH levels. There was approximately 
40% reduction in POSH levels (Figure 21). 

15 

Exam ple 7. Effect of hPOSH on Gag-EGFP intracellular distribution 

HeLa SS6 were transfected with Gag-EGFP, 24 hours after an initial 
transfection with either hPOSH-specific or scrambled siRNA (control) (lOOnM) or 
with plasmids encoding either wild type hPOSH or hPOSH C(12,55)A. Fixation 

20 and staining was preformed 5 hours after Gag-EGFP transfection. Cells were fixed, 
stained with Alexa fluor 647-conjugated Concanavalin A (ConA) (Molecular 
Probes), permeabilized and then stained with sheep anti-human TGN46. After the 
primary antibody incubation cells were incubated with Rhodamin-conjugated goat 
anti-sheep. Laser scanning confocal microscopy was performed on LSM510 

25 confocal microscope (Zeiss) equipped with Axiovert 100M inverted microscope 
using x40 magnification and 1.3-numerical-aperture oil-immersion lens for imaging. 
For co-localization experiments, 10 optical horizontal sections with i ntervals of 1 
um were taken through each preparation (Z-stack). A single median section of each 
preparation is shown. See Figure 22. 

30 

Example 8. POSH-Reeulated Intracellular T ransport of Mvristoylated Proteins 
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The localization of myristoylated proteins, Gag (see Figure 22), HIV-1 Nef, 
Src and Rapsyn, in cells depleted of hPOSH were analyzed by immunofluorescence. 
In control cells, HIV-1 Nef was found in a perinuclear region co-localized with 
hPOSH, indicative of a TGN localization (Figure 23). When hPOSH expression was 

5 . reduced by siKNA treatment, Nef expression was weaker relative to control and nef 
lost its TGN, perinuclear localization. Instead it accumulated in punctated 
intracellular loci segregated from the TGN. 

Src is expressed at the plasma membrane and in intracellular vesicles, which 
are found close to the plasma membrane (Figure 24, HI 87 cells). However, when 

10 hPOSH levels were reduced, Src was dispersed in the cytoplasm and loses its plasma 
membrane proximal localization detected in control (HI 87) cells (Figure 24, 
compare H153-1 and H187-2 panels). 

Rapsyn, a peripheral membrane protein expressed in skeletal muscle, plays a 
critical role in organizing the structure of the nicotinic postsynaptic membrane 

15 (Sanes and Lichtman, Armu. Rev. Neurosci. 22: 389-442 (1999)). Newly 

synthesized Rapsyn associates with the TGN and than transported to the plasma 
membrane (Marchand et al., J. Neurosci. 22: 8891-01 (2002)). In hPOSH-depleted 
cells (H153-1) Rapsyn was dispersed in the cytoplasm, while in control cells it had a 
punctuated pattern and plasma membrane localization, indicating that hPOSH 

20 influences its intracellular transport (Figure 25). 



Materials and Methods Used: 
• Antibodies: 

Src antibody was purchased from Oncogene research products( Darmstadt, 
25 Germany). Nef antibodies were pusrchased from ABI (Columbia, MA) and 
Fitzgerald Industries Interantional (Concord, MA). Alexa Fluor conjugated 
antibodies were pusrchased from Molecular Probes Inc. (Eugene, OR). 

hPOSH antibody: Glutathione S-transferase (GST) fusion plasmids were 
constructed by PCR amplification of hPOSH codons 285-430. The amplified PCR 
30 products was cloned into pGEX-6P-2 (Amersham Pharmacia Biotech, 

Buckinghamshire, UK). The truncated hPOSH protein was generated in E. coli 
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BL21 . Bacterial cultures were grown in LB media with carbenicillin (100 |J.g/ml) 
and recombinant protein production was induced with 1 mM IPTG for 4 hours at 30 
°C. Cells were lysed by sonication and the recombinant protein was then isolated 
from the cleared bacterial lysate by affinity chromatography on a glutathione- 

5 sepharose resin (Amersham Pharmacia Biotech, Buckinghamshire, UK). The 
hPOSH portion of the fusion protein was then released by incubation with 
PreScission protease (Amersham Pharmacia Biotech, Buckinghamshire, UK) 
according to the manufacturer's instructions and the GST portion was then removed 
by a second glutathione-sepharose affinity chromatography. The purified partial 

1 0 hPOSH polypeptide was used to immunize New Zealand white rabbits to generate 
antibody 15B (Washington Biotechnology, Baltimore, Maryland). 

• Construction of siRNA retroviral vectors: 

hPOSH scrambled oligonucleotide (5'- CACACACTGCCG TCAACT 
GTTCAAGAGAC AGTTGACGGCAGTGTGTGTTTTTT -3'; and 5'- 

1 5 AATTAAAAAAC AC A C ACTGCCGTC AACTGTC TCTTGAAC AGTTGA 
CGGCAGTGTGTGGGCC -3') were annealed and cloned into the Apal-EeoRI 
digested pSilencer 1.0-US (Ambion) to generate pSIL-scrambled. Subsequently, the 
U6-promoter and RNAi sequences were digested with BamHI, the ends filled in and 
the insert cloned into the Olil site in the retroviral vector, pMSVhyg (Clontech), 

20 generating pMSCVhyg-U6-scrambled. hPOSH oligonucleotide encoding RNAi 

against hPOSH (5 ' - AAC AGAGGCCTTGG AAA CCTGGAAGC TTGCAGGTTT 
CCAAGGCCTCTGTT -3'; and 5'- GATCAACAGAG GCCTTGGAAACCTGC 
AAGCTTCCAGGTTTCCAA GGCCTCTGTT -3') were annealed and cloned into 
the BamHI-EcoRI site of pLIT-U6, generating pLIT-U6 hPOSH-230. pLIT-U6 is an 

25 shRNA vector containing the human U6 promoter (amplified by PCR from human 
genomic DNA with the primers, 5'-GGCCCACTAGTCA AGGTCG GGCA 
GGAAGA- 3' and 5'- GCCGAATT CAAAAAGGATC CGGCGATATCCGG 
TGTTTCGTCCTTTCCA -3') cloned into pLITMUS38 (New England Biolabs) 
digested with Spel-EcoRI. Subsequently, the U6 promoter-bPOSH shRNA (pLIT- 

30 U6 hPOSH-230 digested with SnaBI and Pvul) was cloned into the Olil site of 
pMSVhyg (Clontech), generating pMSCVhyg U6-hPOSH-230. 
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Generation of stable clones: 



. "' J[EK 293T cells were transfected with retroviral RNAi plasmids 
(pMSCVhyg-U6-POSH-230 and pMSCVhyg-U6-scrambled and with plasmids 
encoding VS V-G and moloney gag-pol. Two days post transfection, medium 

5 containing retroviruses was collected and filtered and polybrene was added to-a final 
concentration of 8ug/ml. This was used to infect HeLa SS6 cells grown in 60 mm 
dishes. Forty-eight hours post-infection cells were selected for RNAi expression by 
the addition of hygromycin to a final concentration of 300 [ag/ml. Clones expressing 
RNAi against hPOSH were named HI 53, clones expressing scrambled RNAi were 

10 named HI 87. 

• Transfection and immunofluorescent analysis: 

Gag-EGFP experiments are described in Figure 22. 

HI 53 or HI 87 cells were transfected with Src or Rapsyn-GFP (Image clone 
image: 3530551 or pNLenv-1). Eighteen hours post transfection cells were washed 

15 withPBS and incubated on ice with A lexa Fluor 6 47 conjugated Con A to label 
plasma membrane glycoproteins. Subsequently cells were fixed in 3% 
paraformaldehyde, blocked with PBS containing 4% bovine serum albumin and 1% 
gelatin. Staining with rabbit anti-Src, rabbit anti-hPOSH (15B) or mouse anti-nef 
was followed with secondary antibodies as indicated. 

20 Laser scanning confocal microscopy was performed on LSM510 confocal 

microscope (Zeiss) equipped with Axiovert 100M inverted microscope using x40 
magnification and 1.3-numerical-aperture oil-immersion lens for imaging. For co- 
localization experiments, 10 optical horizontal sections with intervals of 1 um were 
taken through each preparation (Z-stack). A single median section of each 

25 preparation is shown. 

Example 9. POSH Reduction bv siRNA Ab rogates West Nile Virus ("WNV") 
Infectivitv. 

HeLa SS6 cells were transfected with either control or POSH-specific 
30 siRNA. Cells were subsequently infected with WNV (4x 1 0 4 PFU/well). Viruses 
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were harvested 24 hours and 48 hours post-infection, serially diluted, and used to 
infect Vero cells. As a control WNV (4xl0 4 PFU/well), that was not passed through 
HeLa SS6 cells, was used to infect Vero cells. Virus titer was determined by plaque 
assay in Vero cells. 

5 Virus titer was reduced by 2.5-log in cells treated with POSH-specific 

siRNA relative to cells transfected with control siRNA, thereby indicating that 
WNV requires POSH for virus secretion. See Figure 26. 

Experimental Procedure: 

10 • Cell culture, transfections and infection: 

Hela SS6 cells were grown in Dulbecco's modified Eagle's medium 
(DMEM) supplemented with 10% heat-inactivated fetal calf serum and 100 units/ml 
penicillin and 100 ug/ml streptomycin. For transfections, HeLa SS6 cells were 
grown to 50% confluency in DMEM containing 10% FCS without antibiotics. Cells 

1 5 were then transfected with the relevant double-stranded siRNA (1 00 nM) using 
lipofectamin 2000 (Invitrogen, Paisley, UK). On the day following the initial 
transfection, cells were split 1:3 in complete medium and transfected with a second 
portion of double-stranded siRNA (50 nM). Six hours post-transfection medium was 
replaced and cells infected with WNV (4x1 0 4 PFU/well). Medium was collected 

20 from infected HeLa SS6 cells twenty-four and forty-eight post-infection (200 

serially diluted, and used to infect Vero cells. Virus titer was determined by plaque 
assay (Ben-Nathan D, Lachmi B, Lustig S, Feuerstien G (1991) Protection of 
dehydroepiandrosterone (DHEA) in mice ifected with viral encephalitis. Arch Viro; 
120, 263-271). 

25 

Example 10. Analysis of the effects of POSH knockdown on M-MuLV expression 
and budding 
Experimental Protocol: 
Transfections:- 

30 A day before transfection, Hela SS6 cells were plated in two 6 wells plates at 

5 xlO 5 cells per well. 24 hours later the following transfections were performed: 
4 wells were transfected with control siRNA and a plasmid encoding MMuLV. 
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4 wells were transfected with POSH siRNA and a plasmid encoding MMuLV. 
1 well was a control without any siRNA or DNA transfected. 
1 well was transfected with a plasmid encoding MMuLV. 

For each well to be transfected 100 nM (12.5 ul) POSH siRNA or 100 nM 
(12.5 ul) control siRNA were diluted in 250 ul Opti-MEM (Invitrogen). 
Lipofectamin 2000 (5 ul) (Invitrogen, Cat. 1 1668-019) was mixed with 250 ul of 
OptiMEM per transfected well. The diluted siRNA was mixed with the lipofectamin 
2000 mix and the solution incubated at room temperature for 30 min. The mixture 
was added directly to each well containing 2 ml DMEM +10% FBS (w/o 
antibiotics). 

24 hours later, four wells of the same siRNA treatment were split to eight 
wells, and two wells without siRNA were split to four wells. 

24 hours later all wells were transfected with 100 nM control siRNA or 100 
nM POSH siRNA with or without a plasmid encoding MMuLV (see table below). 
48 hours later virions and cells were harvested. 



No of 
wells 


RNAi 


Amount 
ofRNAi 
(ul) per 
well 


Amount of DNA 
(fig) per well 


The volume of 
DNA (ul) per 
well 


Application 


5 


POSH 
100nM(l sl 
and 2 nd 
transfection) 


12.5 


MMuLV (2 ug) 


10 


4 wells for 
VLPs assay 
and 1 well for 
RT 


5 


Control 
100 nM (l sl 
and 2 nd 
transfection) 


12.5 


MMuLV (2 ug) 


10 


4 wells for 
VLPs assay 
and 1 well for 
RT 


1 








10 ul H 2 0 


VLPs assay 


1 






MMuLV (2 ug) 


10 


VLPs assay 



Steady state VLP assay 
Cell extracts:- 

1 . Pellet floating cells by centrifugation (10 min, 500xg at 4 °C), save 
supernatant (continued at step 7), wash cells once, scrape cells in ice-cold 
lxPBS, add to the corresponding cell pellet and centrifuge for 5 min 1800 
rpm at 4 °C. 

2. Wash cell pellet once with ice-cold lxPBS. 
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... 3. Resuspend cell pellet in 150 nl 1% Triton X-100 lysis buffer (20 mM 

HEPES-NaOH, pH 7.4, 150 mM NaCl, 1.5 mM MgCl 2 , 1 mM EDTA, 1% 
L ^Triton X-100 and lx protease inhibitors) and incubate 20 minutes on ice. 

4. Centrifuge at 14,000rpm for 15 min. Transfer supernatant to a clean tube. 
5 5. Determine protein concentration by BCA. 

6. Prepare samples for SDS-PAGE by adding 2 p.1 of 6xSB to 20 fig exfract 
(add lysis buffer to a final volume of 12 fil), heat to 80 °C for 10 min. 

Purification of virions from cell media 
10 7. Filtrate the supernatant through a 0.45 pm filter. 

8. Transfer 1500 jxl of virions fraction to an ultracentrifuge tube (swinging 
rotor). 

9. Add 300 ill of fresh sucrose cushion (20% sucrose in TNE) to the bottom of 
the tube. 

15 10. Centrifuge supernatant at 35000 rpm at 4 °C for 2 hr. 

11. Resuspend virion pellet in 50 \il hot lx sample buffer each (samples 153-1, 
2, 3, 187-1, 2, 3). Resuspend VLPs pellet (153-4, 5 and 187 4, 5) in 25 *il hot 
lx sample buffer. Vortex shortly, transfer to an eppendorf tube, unite VLPs 
from wells 153-4+5 and 187- 4+5. Heat to 80 °C for 10 min. 
20 12. Load equal amounts of VLPs relatively to cells extracts amounts. 

Western Blot analysis 

1 . Separate all samples on 12% SDS-PAGE. 

2. Transfer samples to nitrocellulose membrane (1 00 V for 1 . 1 5 hr). 

3. Dye membrane with ponceau solution. 

25 4. Block with 10% low fat milk in TBS-T for 1 hour. 

5. Incubate membranes with Goat anti p30 (8 1 S-263) (1 :5000) in 1 0% low fat 
milk in TBS-T over night at 4 °C. Incubate with secondary antibody rabbit 
anti goat-HRP 1:8000 for 60 min at room temperature. 

6. Detect signal by ECL reaction. 

30 7. Following the ECL detection incubate memebranes with Donkey anti rabbit 

Cy3 (Jackson Laboratories, Cat 71 1-165-152) 1:500 and detect signal by 
Typhoon scanning and quantitate. 
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Results: 

As shown in Figure 27, POSH knockdown decreases the release of 
extracellular MMuLV particles. 

5 

Example 1 1. POSH Protein-protein interactions bv veast two hybrid assay 

POSH-associated proteins were identified by using a yeast two-hybrid assay. 
Procedure: 

Bait plasmid (GAL4-BD) was transformed into yeast strain AH109 
1 0 (Clontech) and transformants were selected on defined media lacking tryptophan. 
Yeast strain Y187 containing pre-transformed Hela cDNA prey (GAL4-AD) library 
(Clontech) was mated according to the Clontech protocol with bait containing yeast 
and plated on defined media lacking tryptophan, leucine, histidine and containing 2 
mM 3 amino triazoL Colonies that grew on the selective media were tested for beta- 
1 5 galactosidase activity and positive clones were further characterized. Prey clones 
were identified by amplifying cDNA insert and sequencing using vector derived 
primers. 
Bait: 

Plasmid vector: pGBK-T7 (Clontech) 
20 Plasmid name: pPL269- pGBK-T7 GAL4 POSHdR 

Protein sequence: Corresponds to aa 53-888 of POSH (RING domain deleted) 

RTLVGSGVEELPSNILLVRLIiDGIKQRPWKPGPGGGSGTNCTNALRSQSSTVANCSSKDL 
QSSQGGQQPRVQSWS PPVRGI PQLPCAKALYNYEGKEPGDLKFSKGDI 1 1 LRRQVDENWY 
HGEWGIHGFFPTNFVQIIKPLPQPPPQCKALYDFEVKDKEADKDCLPFAKDDVIjTVIRR 
25 VDENWAEGMLADKIGI FPI S YVEFNSAAKQL I EWDKP PVPGVDAGECSSAAAQSSTAPKH 
SDTKKNTKKRHSFTSLTMANK5SQASQNRHSMEISPPVLISSSNPTAAARISELSGLSCS 

APSQVHISTTGLIVTPPPSSPWTGPSFTFPSDVPYQ 

TPPGATAAAAAAGMGPRPMAGSTDQIAHLRPQTRPSVYVAIYPYTPRKEDELELRKGEMF 
LVFERCQDGWFKGTSMHTSKIGVFPGNYVAPVTRAVTNASQAKVPMSTAGQTSRGVTMVS 
30 PSTAGGPAQKLQGNGVAGS PS WPAAWS AAH I QTS PQAKVLLHMTGQMTVNQARNAVRT 
VAAHNQERPTAAVTP I QVQNAAGLS PASVGLSHHSLiAS PQPAPLMPGSATHTAAI S I SRA 
SAPIiACAAAAPIiTSPSITSASLEAEPSGRIVTVLPGLPTSPDSASSACGNSSATKPDKDS 
KKEKKGLLKLIiSGASTKRKPRVSPPASPTLEVELGSAELPLQGAVGPELPPGGGHGRAGS 
CPVDGDGPVTTAVAGAALAQDAFHRKASSLDSAVPIAPPPRQACSSLGPVLNESRPWCE 

35 RHRVWS YPPQSEAELELKEGD I VFVHKKREDGWFKGTLQRNGKTGIjFPGS FVENI 
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Library screened: Hela pretransformed library (Clontech). 

POSH-APs identified by yeast two-hybrid assay are provided in Tables 7 and 
8. Also, the nucleic acid and amino acid sequences of POSH-APs identified by 
yeast two-hybrid assay are provided in Figure 36. In addition, the nucleic acid and 
5 amino acid sequences of ARF1 and AKF5 are provided in Figure 36. 

F.xam ple 12. Inhibition of PKA Kinase Activit y Attenuates HIV-l Virus Maturation 
HeLa SS6 cells were transfected with pNLenv-l PT AP or pNLenv- 1 ataa (L- 
domain mutant). Eighteen hours post-transfection, cells were transferred to 20 °C for 

10 two hours in order to inhibit transport of viral particles from the trans-Go\& (TGN) 
to the plasma membrane (PM). Subsequently, the PKA inhibitor, H89 (50 uM) 
(Biosource, Cat. No. PHZ1114) or DMSO were added to the cells and dishes were 
transferred to 37 °C to initiate transport from the TGN to the PM. Reverse 
transcriptase activity was assayed from virus-like-particles collected from cell 

15 supernatant twenty minutes later. H89 treatment resulted in complete inhibition of 
RT activity. Thus, demonstrating that PKA activity is required for HTV-1 viral 
maturation. 

Materials and methods: 
20 Cell culture and transfections 

Hela SS6 cells were grown in Dulbecco's modified Eagle's medium 
(DMEM) supplemented with 10% heat-inactivated fetal calf serum and 100 units/ml 
penicillin and 100 ug/ml streptomycin. For transfections, HeLa SS6 cells were 
grown to 100% confluency in DMEM containing 10% FCS without antibiotics. 
25 Cells were then transfected with HIV-l N Lenvi (2 \xg per 6-well) (Schubert et al., 
1995). 

Assays for virus release by RT activity 

Virus and virus-like particle (VLP) release by reverse transcriptase activity 

was determined one day after transfection with the pro-viral DNA as previously 

30 described ( Adachi et al., 1 986; Fukumori et al., 2000; Lenardo et al., 2002). The 

culture medium of virus-expressing cells was collected and centrifuged at 500 x g 
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for 10 minutes. The resulting supernatant was passed through a 0.45 |am-pore filter 
and the filtrate was centrifuged at 14,000 x g for 2 hours at 4 °C. The resulting 

^ supernatant was removed and the viral-pellet was re-suspended in cell solubilization 
buffer (50 raM Tris-HCl, pH7.8, 80 mM potassium chloride, 0.75 raM EDTA and 
5 0.5% Triton X-100, 2.5 mM DTT and protease inhibitors). The corresponding cells 
were washed three times with phosphate-buffered saline (PBS) and then solubilized 
by incubation on ice for 15 minutes in cell solubilization buffer. The cell detergent 
extract was then centrifuged for 15 minutes at 14,000 x g at 4 °C. The sample of the 
cleared extract (normally 1:10 of the initial sample) were resolved on a 12.5% SDS- 

10 polyacrylamide gel, then transferred onto nitrocellulose paper and subjected to 
immunoblot analysis with rabbit anti-CA antibodies. The CA was detected after 
incubation with a secondary anti-rabbit antibody conjugated to Cy5 (Jackson 
Laboratories, West Grove, Pennsylvania) and detected by fluorescence imaging 
(Typhoon instrument, Molecular D ynamics, Sunnyvale, California). The Pr55 and 

15 CA were then quantified by densitometry. A colorimetric reverse transcriptase assay 
(Roche Diagnostics GmbH, Mannenheim, Germany) was used to measure reverse 
transcriptase activity in VLP extracts. RT activity was normalized to amount of Pr55 
and CA produced in the cells. 

20 Example 13. hPOSH is phosphorvlated bv Prote in kinase A (PKA) 

PKA is a cAMP-dependent kinase. The holoenzyme is a tetramer of two 
catalytic subunits (cPKA) bound to two regulatory subunits PRKR1 or PRKR2. 
Activation proceeds by the cooperative binding of two cAMP molecules to each R 
subunit, which causes the dissociation of each active C subunit from the R subunit 

25 dimer. The consensus sequence for phosphorylation by the C subunit is, stringently, 
K/R-R-X-S/TY and less stringently, R-X-X-S/TY, where Y tends to be a 
hydrophobic residue. The intracellular localization of PKA is controlled thorough 
association w ith A -kinase-anchoring p roteins ( AKAPs). T he r egulatory s ubunit o f 
protein kinase A (PRKR1A) was identified as a POSH interactor by yeast-two- 

30 hybrid screen, thereby implicating POSH as an AKAP. 

Protein kinase A was demonstrated to be required for the budding of 
transport vesicles from the TGN (Muniz et al., 1997, Proc Natl Acad Sci USA, 
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94:14461-6). Furthermore, it was demonstrated that an inhibitor of PKA, H89, is 
able to block HIV-1 release from cells (Cartier et al., 2003, J Biol Chem., 
278:35211-9). Since POSH is localized at the TGN and is implicated as an AKAP, 
POSH may regulate PKA-mediated budding at the TGN of vesicles and HTV-1 . 

5 Applicants demonstrated that POSH is phosphorylated by PKA. Several 

putative PKA phosphorylation sites are found within hPOSH coding sequence 
(Figure 30). Phosphorylation of gravin, an AKAP, by PKA modulates its binding to 
the b2-adrenergic receptor. This serves to regulate the mobilization of gravin and 
PKA to the cell membrane and regulation of b2-AR activity by PKA. Two putative 

10 PKA sites are located in the putative-rac-binding region in POSH. Toward this end, 
POSH was subjected to in-vitro phosphorylation and binding to the small GTPase 
Racl (Figure 31). Indeed, only unphosphorylated POSH was able to bind activated, 
GTP-loaded, Racl, demonstrating that phosphorylation regulates the binding of 
POSH to small GTPases, such as Racl. GTPases of this sort family include TCL, 

15 TC10, Cdc42, Wrch-1, Rac2, Rac3 or RhoG (Aspenstrom et al., 2003, Biochem J., 
377(Pt 2):327-37). Small GTPases of this sort are involved in protein trafficking in 
the secretory system, including the trafficking of viral proteins, such as those of 
HIV. 

Materials and methods 

20 PKA-dependent phosphorylation of hPOSH. 

Bacterially expressed recombinant maltose-binding-protein (MBP)-hPOSH 
(3 ug) or GST-c-Cbl were incubated at 30oC for 30 minutes with (*) or without 10 
ng PKA catalytic subunit (PKAc) in a buffer containing 40 mM Tris-HCl pH 7.4, 10 
mM MgCl 2 , 4 mM ATP, 0.1 mg/ml BSA, 1 uM cAMP, 23 mM K3PO4, 7 nM DTT, 

25 and PKA peptide protection solution (Promega, Cat.No. V5340). The reaction was 
stopped by the addition of SDS-sample buffer, and boiling for 3 minutes. Samples 
were separated by SDS-PAGE on a 10% gel, and transferred to nitrocellulose and 
immunoblotted as detailed in the figure. 

Binding of Racl to hPOSH 
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Bacterially expressed hPOSH (1 ug) or GST (1 ug) were phosphorylated as 
above. The reaction was terminated by the addition 0.5 ml of ice-cold 200 mM Tris- 
HC1 pH 7.4, 5 mM EDTA. hPOSH and GST were then immobilized on NiNTA or 
reduced glutathione beads, respectively, by gentle mixing for 30 minutes. The 

5 immobilized proteins were washed three times with wash buffer (50 mM Tris-HCl 
pH 7.4, 100 mM NaCl, 5 mM MgC12, 0.1 mM DTT). Recombinant Rac-1 (0.2 fig) 
(Sigma catalog # R3012) was incubated with or without 0.3 mM GTPyS (Sigma Cat. 
No. G8638) on ice for 15 minutes. The GTP/mock-loaded Rac-1 was then added to 
wash buffer (25 ul, final) and incubated for 30 minutes at 30 °C. The beads were 

1 0 then washed three times with wash buffer containing 0.1% Tween 20. Sample buffer 
was added to the bead pellet and boiled for 3 minutes. Immobilized and associating 
proteins were then separated by SDS-PAGE on a 12% gel and immunobloted with 
anti-Rac-1 (Santa Cruz Biotechnology, Cat. No. sc-217). Input is 0.25 ug of Rac-1 . 



15 Exam ple 14. HERPUD1 Depletion bv siRNA Reduces HIV Maturation. 

Hela SS6 cells were transfeted with siRNA directed against HERPUD1 and 
with aplsmid encoding HIV proviral genome (pNLenv-1). Twenty four hours post- 
HIV transfection, virus-like particles (VLP) secreted into the medium were isolated 
and reverse transcriptase activity was determined. HIV release of active RT is an 

20 indication for a release of processed and mature virus. When the levels of 

HERPUD1 were reduced RT activity was inhibited by 80%, demonstrating the 
importance of HERPUD1 in HTV-maturation. See Figure 33. 
Experimental Outline 

• Cell culture and transfection: 

25 HeLa SS6 were kindly provided by Dr. Thomas Tuschl (the laboratory of 

RNA Molecular Biology, Rockefeller University, New York, New York). Cells 
were grown in Dulbecco's modified Eagle's medium (DMEM) supplemented with 
10% heat-inactivated fetal calf serum and 100 U/ml penicillin and 100 ug/ml 
streptomycin. For transfections, HeLa SS6 cells were grown to 50% confluency in 

30 DMEM containing 1 0% FCS without antibiotics. Cells were then transfected with 
the relevant double-stranded siRNA (50-100nM) (HERPUD1: 5'- 
GGGAAGUUCUUCGG AACCUdTdT-3 ' and 5 
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dTdTCCCUUC AAGAAGCCUUGGA-5 ') using lipofectamin 2000 (Invitrogen, 
Paisley, UK). A day following the initial transfection cells were split 1 :3 in complete 
^ - medium and co-transfected 24 hours later with HIV-lNLenvl (2 ug per 6-well) 
(Schubert et al., J. Virol. 72:2280-88 (1998)) and a second portion of double- 

5 stranded siRNA. 

• Assay for virus release 
Virus and virus-like particle (VLP) release was determined one day after 
transfection with the proviral DNA as previously described (Adachi et al., J. Virol. 
59: 284-91 (1986); Fukumori et al., Vpr. Microbes Infect. 2: 1011-17 (2000); 

10 Lenardo et al., J. Virol. 76: 5082-93 (2002)). The culture medium of virus- 
expressing cells was collected and centrifuged at 500 x g for 10 minutes. The 
resulting supernatant was passed through a 0.45um-pore filter and the filtrate was . 
centrifuged at 1 4,000 x g for 2 hours at 4°C. The resulting supernatant was removed 
and the viral-pellet was re-suspended in SDS-PAGE sample buffer. The 

1 5 corresponding cells were washed three times with phosphate-buffered saline (PBS) 
and then solubilized by incubation on ice for 15 minutes in lysis buffer containing 
the following components: 50 mM HEPES-NaOH, (pH 7.5), 1 50 mM NaCl, 1.5 mM 
MgCl 2 , 0.5% NP-40, 0.5% sodium deoxycholate, 1 mM EDTA, 1 mM EGTA and 
1 :200 dilution of protease inhibitor cocktail (Calbiochem, La Jolla, California). The 

20 cell detergent extract was then centrifuged for 15 minutes at 14,000 x g at 4°C. The 
VLP sample and a sample of the cleared extract (normally 1 :10 of the initial 
sample) were resolved on a 12.5% SDS-polyacrylamide gel, then transferred onto 
nitrocellulose paper and subjected to immunoblot analysis with rabbit anti-CA 
antibodies. The CA was detected either after incubation with a secondary anti-rabbit 

25 horseradish peroxidase-conjugated antibody and detected by Enhanced Chemi- 

Luminescence (ECL) (Amersham Pharmacia) or after incubation with a secondary 
anti-rabbit antibody conjugated to Cy5 (Jackson Laboratories, West Grove, 
Pennsylvania) and detected by fluorescence imaging (Typhoon instrument, 
Molecular Dynamics, Sunnyvale, CA). The Pr55 and CA were then quantified by 

30 densitometry and the amount of released VLP was then determined by calculating 
the ratio between VLP-associated CA and intracellular CA and Pr55 as previously 
described (Schubert et al., J. Virol. 72:2280-88 (1998)). 
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• Analysis of reverse transcriptase activity in supernatants 
RT activity was determined in pelleted VLP (see above) by using an RT 
assay kit (Roche, Germany; Cat.No. 1468120). Briefly, VLP pellets were 
resuspended in 40 pi RT assay lysis buffer and incubated at room temperature for 30 

5 minutes. At the end of incubation 20 pi RT assay reaction mix was added to each 
sample and incubation continued at 37°C overnight. Samples (60 pi) were than 
transferred to MTP strip wells and incubated at 37°C for 1 hour. Wells were washed 
five times with wash buffer and DIG-POD added for a one-hour incubation at 37°C. 
At the end of incubation wells were washed five times with wash buffer and ABST 

10 substrate solution was added and incubated until color developed. The absorbance 
was read in an ELISA reader at 405 nm (reference wavelength 492 ran). The 
resulting signal intensity is directly proportional to RT activity; RT concentration 
was determined by plotting against a known amount of RT enzyme included in 
separate wells of the reaction. 

15 

Example 15. MSTP028 Reduction bv siRNA Decreases HIV VLP Production. 

This example demonstrates the effects of an siRNA-mediated decrease in 
MSTP028 expression on the production of HP/ virus-like particles in HeLa cells. 
The effects were measured at steady state. 
20 Experiments were performed according to two different protocols. 

Experiment 1 proceeded with a second transfection on day 3, while Experiment 2 
involved an additional exchange of medium on day 3, and proceeded to the second 
transfection on day 4. The results from Experiment 1 are shown Figure 29 A, and 
those for Experiment 2 are shown in Figure 29B. 



25 



30 



Day 1 : Preparing Cells 

4.5X105 HeLa SS6 cells/well, were seeded in 1 x 6 well plates. Cells were seeded 
in transfection medium (growing medium free of antibiotics). 

Materials: 

Cat. No. Manufacture Reagent Name 
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D5796 
04-121-1A 
D8537 
P4333 
5 T4049 



Sigma 

Beit Haemek 
Sigma 
Sigma 
Sigma 



DMEM 

FCS 

PBS 

Pen/Strep 

0.25% Trypsin-EDTA 



Day 2: Transfection 
Materials: 

10 Cat. No. Manufacture Reagent Name 
11668-027 Invitrogen LF2000 reagent 
31985-047 GibcoBRL OptiMEM 

# 

MSTP028 RNAi constructs: 
15 siRNA target sequence Accession Pos 

MST028 AAGTGCTCACCGACAGTGAAG NM_031954 197 
MST028 AAGATACTTATGAGCCTTTCT NM_031954 392 



Experimental and Control Conditions: 
20 1 - Control siRNA+ pNLEnv- 1 

2- POSH siRNA + pNLenv-1 

3- MSTP028 siRNA + pNLenv-1 



1 . Two hours before transfection, replace cell media to 2ml/well complete 
25 DMEM without antibiotics. 

2. siRNA dilution: for each transfection dilute 100 nm siRNA in 0.25 ml 
OptiMEM per well. 

3. LF 2000 dilution: for each well dilute 5ul lipofectamine reagent in 0.25ml 
OptiMEM. 

30 4. Incubate diluted siRNAs and LF 2000 for 5 minutes at RT. 

5. Mix the diluted siRNAs with diluted LF2000 and incubated for 25 minutes at 

RT. 
9372369J 
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6. Add the mixture to the cells, 0.5 ml/well (drop wise) and incubate for 24 
hours at 37°C in CO2 incubator. 

Transfections: for each well 
5 (12.5 ii\ (siRNA)/ 0.25 ml OptiMEM) x 3 
LF 2000 35 |il /1.75 ml 

Day 3: 

10 Exp. 1 : second transfection (as Day 4 below). 
Exp. 2: Exchange medium. 

Day 4: 

15 Exp. 1 : VLP assay (see below). 
Exp. 2: Second transfection 

1 . Two hours before transfection, replace cell media to 2ml/well complete 

DMEM without antibiotics. 
20 2. siRNA and DNA dilution: Prepare dilution of plasmid pNLenv-1 0.75 / 

well in 0.25 ml OptiMEM (total of 3 wells). Divide plasmid dilution to 

eppendorf tubes (0.25 ml each). To each tube add siRNA 40nM (2.5 |il). 
3. LF 2000 dilution: for each well dilute 5|il lipofectamine reagent in 0.25ml 

OptiMEM. 

25 4. Incubate diluted siRNAs and LF 2000 for 5 minutes at RT. 

5. Mix the diluted siRNAs with diluted LF2000 and incubated for 1 hour at RT. 

6. Add the mixture to the cells, 0.5 ml/well (drop wise) and incubate for 24 
hours at 37°C in C0 2 incubator. 

30 Day 5: 



Exp. 2: VLP assay. 
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Solutions: 
Lysis buffer 
Tris-HCl pH 7.6 
MgC12 
NaCl 
Glycerol 
NP-40 
DOC 
EDTA 
EGTA 

AddPI 3 C 1:200. 



50mM 



10% 
0.5% 
0.5% 



1.5mM 
150mM 



ImM 
ImM 



15 



20 



Steady state VLP assay 
A. Cell extracts 



1. 



2. 
3. 



25 5. 



Pellet floating cells by centrifugation (lmin, 14000rpm at 40C), save 
supernatant (continue with supernatant immediately to step B), scrape cells 
in'ice-cold PBS, add to the corresponding floated cell pellet and centrifuge 
for 5min 1 800rpm at 40C. 
Wash cell pellet once with ice-cold PBS. 

Resuspend cell pellet (from 6 well) in 100 \il NP40-DOC lysis buffer and 
incubate 10 minutes on ice. 

Centrifuge at 14,000rpm for 15min. Transfer supernatant to a clean 
eppendorf. 

Prepare samples for SDS-PAGE by adding them sample buffer and boil for 
lOmin - take the same volume for each reaction (15 ul). 



30 



B. Purification of VLP from cell media 

1. Filtrate the supernatant through a 0.45um filter. 

2. Centrifuge supernatant at 1 4,000rpm at 40C for at least 2h. 

3. Resuspend VLP pellet in 50 ul IX sample buffer and boil for 10 min. Load 
25 pi of each sample. 
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C. Western Blot analysis 
, - 1 . -Run all samples from stages A and B on Tris-Gly SDS-PAGE 1 2.5%. 



4. Block with 1 0% low fat milk in TBS-t for lh. 

5. Incubate with anti p24 rabbit 1 :500 in TBS-t 2 hour (room temperature) - 
overnight (40C). 

6. Wash 3 times with TBS-t for 7min each wash. 

1 0 7. Incubate with secondary antibody anti rabbit cy5 1 :500 for 30min. 

8. Wash five times for lOmin in TBS-t 

9. View in Typhoon for fluorescence signal (650). 

# 

Example 16. POSH-depleted cells have lower levels o f Hero and it is not 
15 monoubiquitinated 

POSH-depleted cells and their control counterparts were lysed and 
immunoblotted with anti-herp antibodies. Cells depleted of POSH (HI 53 RNAi 
stables cell lines) cells have lower levels of Herp compared with control cells (HI 87 
RNAi) (Figure 34A panel A). When cells were trasnfected with a plasmid encoding 

20 flagged-tagged ubiquitin, and immunoprecipitated with anti-flag antibodies to 

immunoprecipitate ubiquitinated proteins, Herp was ubiquitinated only in HI 87 cells 
and not in HI 53 cells (Figure 34A panel B). When the aforementioned cells were 
transfected with Herp-encoding plasmid, exogenous herp levels were also reduced in 
HI 53 cells compared to HI 87 cells (Figure 34B panel A) and the ubiquitination of 

25 exogenous herp was reduced in the former cells, similar to endogenous Herp. The 
molecular weight of ubiquitinated Herp is as predicated to full-length Herp and does 
not seem as a high molecular weight smear, a characteristic of polyubiquitinated 
proteins. Thus POSH is responsible for the mono-ubiquitination of Herp, and in the 
absence of this modification herp is subjected to degradation, which may be 

30 mediated by the proteosome. 



5 



2. 



3. 



Transfer samples to nitrocellulose membrane (100V for 1 .15h.). 
Dye membrane with ponceau solution. 



Materials and methods 
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Plasmid generation 

Full-length Herp was cloned from image clone MGC:45 131 IMAGE:5575914 
(GeneBank Accesion BC032673) into pCMV-SPORT6. 

5 

Antibody production 

Herp 1 (amino a cids 1 to251)wasa mplified f rom a p lasmid ( 3Gd4) o btained b y 
yeast two hybrid screen for interactors of POSH. The amplified open reading frame 
was cloned into pGEX-6P, expressed in E. coli BL21 by induction with 1 mM IPTG 
10 and purified on glutathione-agarose. The purified protein was cleaved with 
Precision™ protease (Amersham Biosciences) and the GST moiety removed by 
glutathione chromatography. The protein was injected into rabbits (Washington 
Biotechnology) to produce anti-Herpl sera. 

15 Transfections and antibody detection 

Twenty-four hours prior to transfection POSH-RNAi clones (H153) or control- 
RNAi clones (HI 87) cells were plated in 10 cm dishes in growth medium (DMEM 
containing 10% fetal calf serum without antibiotics). Cells were transfected with 
lipofectamin 2000 (frivitrogen Corporation) and either Herp-expression plasmid (2.5 

20 ug) or empty vector (2.5 ug) and a vector encoding Flag-tagged ubiquitin (1 ug). 
Twenty-four hours post-trasnfection cells were lysed in lysis buffer (50 mM Tris- 
HC1, pH7.6, 1.5 mM MgC12, 150 mM NaCL, 10% glycerol, 1 mM EDTA, 1 mM 
EGTA, 0.5% NP-40 and 0.5% sodium deoxycholate, containing protease inhibitors) 
and subjected to immunoprecipitation with anti-Flag antibodies (Sigma, F7425) to 

25 precipitate ubiquitinated proteins, hnmunoprecipitated material and total cell lysates 
were separated on 10% SDS-PAGE and transferred to nitrocellulose membranes 
which were immunoblotted with anti-Herp antibodies. 

Generation of H187 and H153 cell lines 
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To relieve the necessity for multiple transfections and to improve the reproducibility 
of hPOSH reduction, we have generated two cell lines, H187 and H153 
constitutively expressing an integrated control and hPOSH siRNA (respectively). 

Construction of shRNA retroviral vectors- hPOSH scrambled oligonucleotide (5'- 
5 CACACACTGCCGTCAACTGTTCAAGAGACAGTTGACGGCAGTGTGTGTTT 
TTT-3'; and 5 ' - AATTAAAAAAC AC AC ACTGCCGTC AACTGTCTCTTG AAC A 
GTTGACGGCAGTGTGTGGGCC- 3') were annealed and cloned into the Apal- 
EcoRI digested pSilencer 1.0-U6 (Ambion, Inc.) to generate pSIL-scrambled. 
Subsequently, the U6-promoter and RNAi sequences were digested with BamHI, 
1 0 and blunted by end filling. The insert was cloned into the Olil site in the retroviral 
vector, pMSCVhyg (BD Biosciences Clontech), generating pMSCVhyg-U6- 
scrambled. The hPOSH oligonucleotide encoding RNAi against hPOSH 
(5'-AACAGAGGCCTTGGAAACCTGGAAGCTTGCAGGTTTCCAAGGCCTCT 

GTT-3'; and 

15 5'-GATCAACAGAGGCCTTGGAAACCTGCAAGCTTCCAGGTTTCCAAGGC 
CTCTGTT-3') were annealed and cloned into the BamHI-EcoRV site of pLIT-U6, 
generating pLIT-U6 hPOSH-230. The pLIT-U6 is an shRNA vector containing the 
human U6 promoter (amplified by PCR from human genomic DNA with the 
primers, 5 ' -GGCCC ACTAGTC AAGGTCGGGC AGGAAG A-3 ' and 

20 5'-GCCGAATTCAAAAAGGATCCGGCGATATCCGGTGTTTCGTCCTTTCCA- 
3') cloned into pLITMUS38 (New England Biolabs, Inc.) digested with Spel-EcoRI. 
Subsequently, the U6 promoter-hPOSH shRNA (pLIT-U6 hPOSH-230 digested 
with SnaBI and Pvul) was cloned into the Olil site of pMSCVhyg (BD Biosciences 
Clontech) generating pMSCVhyg U6-hPOSH-230. 
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Recombinant retrovirus production- HEK 293T cells were transfected with 
~ retroviral RNAi plasmids (pMSCVhyg-U6-POSH-230 and pMSCVhyg-U6- 
^ scrambled and with plasmids encoding VSV-G and Moloney Gag-pol. Two days 
post-transfection, the retrovirus-containing medium was collected and filtered. 

5 Infection and selection- Polybrene (Hexadimethrine bromide) (Sigma) (8ug/ml) 
was added to the filtered and the treated medium was subsequently used to infect 
HeLa SS6 cells. Forty-eight hours post-infection clones were selected for RNAi 
expression by the addition of hygromycin (300 ug/ml). Clones expressing the 
scrambled and the hPOSH RNAi were termed HI 87 and H153 (respectively). 

10 Example 17. Inhibition of HBV production 

HepG2.2.15 cells were plated on 9cm dishes and allowed to grow in 8% FCS 
for 5 days up to 70% confluence. After 5 days, cells were washed twice with PBS 
and re-supplied with fresh DMEM without FCS. In this medium, cells were treated 
every 24 hours with the depicted solutions (3ul solution/lml medium) for another 4 

15 days (4 treatments total). After 4 days, medium was collected from each plate, 
viruses were sedimented and analyzed. 

As shown in Figure 35, lanes 7 and 8, compounds CAS number 14567-55-4 
and CAS number 414908-38-0 inhibit HBV production at a concentration of 3uM. 
Detection of HBV proteins was performed essentially as described in Paran, N et al 

20 (2001)EMBOJ20(16):4443-4453. 



INCORPORATION BY REFERENCE 

All publications and patents mentioned herein are hereby incorporated by 
reference in their entirety as if each individual publication or patent was specifically 
25 and individually indicated to be incorporated by reference. In case of conflict, the 
present application, including any definitions herein, will control. 
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EQUIVALENTS 

While specific embodiments of the subject applications have been discussed, 
the above specification is illustrative and not restrictive. Many variations of the 
applications will become apparent to those skilled in the art upon review of this 
specification and the claims below. T he full scope of the applications shouldbe 
determined by reference to the claims, along with their full scope of equivalents, and 
the specification, along with such variations. 



10 
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What Is Claimed: 

1. -- An isolated, purified or recombinant complex comprising a POSH 

polypeptide and a POSH-associated protein (POSH-AP). 

2. The complex of claim 1 , wherein the POSH-AP comprises a polypeptide 
5 selected from the group consisting of: PKA, SNX1 , SNX3, ATP6V0C, 

PTPN12, PPP1CA, GOSR2, CENTB1, DDEF1, ARF1, ARF5, PACS-1, 
EPS8L2, HERPUD1, UNC84B, MSTP028, GOCAP, EIF3S3, SRA1 , CBL- 
B, RALA, SIAH1, SMN1, SMN2, SYNE1, TTC3, VCY2IP1 and UBE2N 
(UBC13). 

10 3. The complex of claim 1, wherein the POSH-AP comprises a polypeptide 
selected from the group consisting of: ARHV (Chp), WASF1, H1P55, 
SPG20, HLA-A, and HLA-B. 

4. The complex of any one of claims 1 -3, wherein the POSH polypeptide is a 
human POSH polypeptide. 

15 5. An isolated, purified or recombinant complex comprising HERPUD 1 and a 
Ubiquitin ligase. 

6. The complex of claim 5, wherein the Ubiquitin ligase is selected from the 
group consisting of: POSH, CBL-B, TTC3, and SIAH1. 

7. A method for identifying an agent that modulates an activity of a POSH 
20 polypeptide or POSH-AP, the method comprising identifying an agent that 

disrupts a complex of any one of claims 1-3, wherein an agent that disrupts a 
complex of any of claims 1-3 is an agent that modulates an activity of the 
POSH polypeptide or the POSH-AP. 

8. A method of identifying an antiviral agent, comprising: 

25 (a) identifying a test agent that disrupts a complex comprising a POSH 

polypeptide and a POSH-AP; and 
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(b) evaluating the effect of the test agent on a function of a virus, 

wherein an agent that inhibits a pro-infective or pro-replicative function of a 
virus is an antiviral agent. 

9. The method of claim 8, wherein the POSH-AP is selected from the group 
consisting of: PKA, SNX1, SNX3, PTPN12, GOSR2, CENTB1, ARF1, 
ARF5, PACS-1, EPS8L2, HERPUD1, SMN1, SMN2, UNC84B, MSTP028, 
GOCAP, CBL-B, SYNE1, UBE2N (UBC13), SIAH1, TTC3, WASF1, 
HIP55, RALA, and SPG20. 

10. The method of claim 8, wherein the virus is an envelope virus. 

11. The method of claim 8, wherein the virus is a Human Immunodeficiency 
Virus. 

12. The method of claim 8, wherein the virus is a West Nile Virus. 

13. The method of claim 8, wherein the virus is Moloney Murine Leukemia 
Virus (MMuLV). 

14. The method of claim 8, wherein evaluating the effect of the test agent on a 
function of the virus comprises evaluating the effect of the test agent on the 

( budding or release of the virus or a virus-like particle. 

15. A method of identifying an anti-apoptotic agent, comprising: 

(a) identifying a test agent that disrupts a complex comprising a POSH 
polypeptide and a POSH-AP; and 

(b) evaluating the effect of the test agent on apoptosis of a cell, 

wherein an agent that decreases apoptosis of the cell is an anti-apoptotic 
agent. 

16. A method of identifying an anti-cancer agent, comprising: 
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(a) identifying a test agent that disrupts a complex comprising a POSH 
polypeptide and a POSH-AP; and 

(b) evaluating the effect of the test agent on proliferation or survival of a 
cancer cell, 

wherein an agent that decreases proliferation or survival of a cancer cell is an 
anti-cancer cell. 

17. The method of claim 16, wherein the POSH-AP is selected from the group 
consisting of: PKA, SNX1, PTPN12, PPP1CA, ARF1, ARF5, CENTB1, 
EPS8L2, EIF3S3, CBL-B, RALA, SIAH1, TTC3, ATP6V0C, and VCY2IP1. 

18. The method of claim 16, wherein the cancer cell is a cell derived from a 
POSH-associated cancer. 

19. A method of identifying an agent that inhibits trafficking of a protein through 
the secretory pathway, comprising: 

(a) identifying a test agent that disrupts a complex comprising a POSH 
polypeptide and a POSH-AP; and 

(b) evaluating the effect of the test agent on the trafficking of a protein 
through the secretory pathway 

wherein an agent that disrupts localization of said POSH-AP is an agent that 
inhibits trafficking of a protein through the secretory pathway. 

20. The method of claim 19, wherein step (b) comprises evaluating the effect of 
the test agent on the trafficking of a myristoylated protein through the 
secretory pathway. 

21. The method of claim 19, wherein step (b) comprises evaluating the effect of 
the test agent on the trafficking of a viral protein through the secretory 
pathway. 
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22. The method of claim 1 9, wherein (b) comprises evaluating the effect of the 
test agent on the trafficking of a protein associated with a neurological 

- • -disorder through the secretory pathway. 

23 . The method of claim 22, wherein the protein associated with a neurological 
5 disorder is amyloid beta precursor protein. 

24. A method of identifying an agent that inhibits the progression of a 
neurological disorder, comprising: 

(a) identifying a test agent that disrupts a complex comprising a POSH 
polypeptide and a POSH-AP; and 

I o (b) evaluating the effect of the test agent on the trafficking of a protein 

through the secretory pathway 

wherein an agent that disrupts localization of a POSH-AP is an agent that 
inhibits progression of a neurological disorder. 

25. The method of claim 24, wherein the POSH-AP is selected from the group 
15 consisting of: HERPUD1, CBL-B, SIAH1, and TTC3. 

26. The method of claim 25, wherein the POSH-AP is HERPUD1 . 

27. A method of identifying an agent that inhibits the progression of a 
neurological disorder, comprising: 

(a) identifying a test agent that disrupts a complex comprising a POSH 
20 polypeptide and a POSH-AP; and 

(b) evaluating the effect of the test agent on the ubiquitination of a 
protein. 

28. The method of claim 27, wherein the POSH-AP is HERPUD1 . 
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29. A method of treating a viral infection in a subject in need thereof, comprising 
administering an agent that inhibits aPOSH-AP in an amount sufficient to 
inhibit the viral infection. 

30. The method of claim 29, wherein the agent is selected from the group 
5 consisting of: 

i) an agent that inhibits a kinase activity of the POSH-AP; 

ii) an agent that inhibits expression of the POSH-AP; 

iii) an agent that inhibits the ubiquitin ligase activity of the POSH-AP; 

iv) an agent that inhibits the phosphatase activity of the POSH-AP; 
i 0 v) an agent that inhibits the GTPase activity of the POSH-AP; and 

vi) an agent that inhibits the ubiquitination of the POSH-AP. 

31. The method of claim 29, wherein the POSH-AP comprises a polypeptide 
selected from the group consisting of: PKA, SNX1, SNX3, SMN1, SMN2, 
PTPN12, GOSR2, CENTB1, ARF1, ARF5, PACS-1, EPS8L2, HERPUD1, 

15 UNC84B, MSTP028, GOCAP, CBL-B, SYNE1, UBE2N (UBC13), SIAH1, 

TTC3, WASF1, HIP55, RALA, and SPG20! 

32. The method of claim 3 1 , wherein the POSH-AP comprises a polypeptide 
selected from the group consisting of: PKA, HERPUD 1 , MSTP028, CBL-B, 
andUBE2N(UBC13). 

20 33. The method of claim 32, wherein said agent is selected from the group 

consisting of: an siRNA construct, a small molecule, an antibody, and an 
antisense construct. 

34. The method of claim 33, wherein the agent is an siRNA construct comprising 
a nucleic acid sequence that hybridizes to an mRNA encoding the POSH-AP. 
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35. 

5 36. 
37. 
38. 

10 

39. 

15 

40. 



The method of claim 34, wherein the agent is an siRNA construct or an 
antisense construct that inhibits the expression of a polypeptide selected from 
the group consisting of PKA, HERPUD1, MSTP028, CBL-B, and UBE2N 
(UBC13). 

The method of claim 35, wherein the agent is an siRNA construct or an 
antisense construct that inhibits the expression of HERPUD1 or MSTP028. 

The method of claim 36, wherein the siRNA construct inhibits the expression 
ofMSTP028. 

The method of claim 36, wherein the siRNA construct inhibits the expression 
of HERPUD1 and is selected from the group consisting of: 5'- 
GGGAAGUUCUUCGGAACCUdTdT-3 ' and 5'- 
dTdTCCCUUC AAGAAGCCUUGGA-5 ' . 

The method of claim 33, wherein the small molecule inhibitor is selected 
from among the following categories: adenosine cyclic 
monophosphorothioate, isoquinolinesulfonamide, piperazine, piceatannol, 
and ellagic acid. 

The method of claim 33, wherein the small molecule is selected from among: 



O 
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41 . The method of claim 23, wherein the small molecule inhibits the 
ubiquitination of a POSH-AP. 

42. The method claim 29, wherein the subject is infected with an envelope virus. 

43 . The method of claim 42, wherein the envelope virus is an HTV. 

44. The method of claim 42, wherein the envelope virus is a WNV. 

45. The method of claim 29, wherein the virus is a MMuLV. 
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46. Use of a protein kinase A inhibitor for the manufacture of a medicament for 
treatment of a viral infection. 

47. Use of an inhibitor of HERPUD1 for the manufacture of a medicament for 
treatment of a viral infection. 

48. Use of an inhibitor of MSTP028 for the manufacture of a medicament for 
treatment of a viral infection. 

49. A packaged pharmaceutical for use in treating a viral infection, comprising: 

(a) a pharmaceutical composition comprising an inhibitor of a POSH-AP 
and a pharmaceutical^ acceptable carrier; and 

(b) instructions for use. 

50. The packaged pharmaceutical of claim 49, wherein the viral infection is 
caused by an envelope virus. 

51. A method for identifying an antiviral agent comprising: 

(a) identifying a test agent that inhibits an activity of or expression of a 
POSH-AP; and 

(b) evaluating an effect of the test agent on a function of a virus. 

52. A method of evaluating an antiviral agent comprising: 

(a) providing a test agent that inhibits an activity of or expression of a 
POSH-AP; and 

(b) evaluating an effect of the test agent on a function of a virus. 

53. The method of claim 5 1 or 52, wherein the virus is an envelope virus. 

54. The method of claim 5 1 or 52, wherein the virus is a Human 
Immunodeficiency Virus. 
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55. The method of claim 5 1 or 52, wherein the virus is a West Nile Virus. 

56. The method of claim 5 1 or 52, wherein the virus is a MMuLV. 

57. The method of claim 5 1 or 52, wherein evaluating the effect of the test agent 
on a function of the virus comprises evaluating the effect of the test agent on 

5 the budding or release of the virus or a virus-like particle. 

58. The method of claim 51 or 52, wherein the POSH-AP is selected from the 
group consisting of: PKA, SNX1, SNX3, PTPN12, GOSR2, SMN1, SMN2, 
CENTB1, ARF1, ARF5, PACS-1, EPS8L2, HERPUD1, UNC84B, 
MSTP028, GOCAP, CBL-B, SYNE1, UBE2N (UBC13), SIAH1, TTC3, 

10 WASF1 , HIP55, RALA, and SPG20. 

59. The method of claim 58, wherein the POSH-AP is HERPUD1 . 

60. The method of claim 58, wherein the POSH-AP is MSTP028. 

6 1 . The method of claim 5 1 or 52, wherein the test agent is selected from among: 
an antisense nucleic acid, an siRNA construct, a small molecule, an antibody 

1 5 and a polypeptide. 

62. The method of claim 6 1 , wherein the siRNA construct inhibits the expression 
of HERPUD1 and is selected from the group consisting of: 5'- 

GGG AAGUUCUUCGGAACCUdTdT-3 ' and 5'- 
dTdTCCCUUC AAGAAGCCUUGGA-5 ' . 

20 63. A method of identifying an agent that modulates a POSH function, 
comprising: 

a) identifying an agent that modulates a POSH-AP; and 

b) testing the effect of the agent on a POSH function. 

64. A method of evaluating an agent that modulates a POSH function, 
25 comprising: 
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a) providing an agent that modulates a POSH-AP; and 
- b) testing the effect of the agent on a POSH function. 

65. The method of claim 64 or 65, wherein the POSH-AP comprises a 
polypeptide selected from the group consisting of: PKA, SNXl, SNX3, 

5 ATP6V0C, PTPN12, PPP1CA, GOSR2, CENTB1, DDEF1, AKF17"ARF5, 

PACS-1, EPS8L2, HERPUD1, UNC84B, MSTP028, GOCAP, EIF3S3, 
SRA1, CBL-B, RALA, SIAH1, SMN1, SMN2, SYNE1, TTC3, VCY2IP1 
andUBE2N(UBC13). 

66. The method of claim 64 or 65, wherein the POSH-AP comprises a 

1 0 polypeptide selected from the group consisting of : ARHV (Chp), WASF 1 , 

HIP55, SPG20, HLA-A, and HLA-B. 

67. The method of claim 64 or 65, wherein testing the effect of the agent on a 
POSH function comprises testing the effect of the agent on the production of 
viral particles or virus like particles in a cell infected with an envelope virus. 

15 68. The method of claim 64 or 65, wherein testing the effect of the agent on a 
POSH function comprises testing the effect of the agent on a POSH 
enzymatic activity. 

69. The method of claim 68, wherein the POSH enzymatic activity is ubiquitin 
ligase activity: 

20 70. The method of claim 64 or 65, wherein testing the effect of the agent on a 
POSH function comprises testing the effect of the agent on POSH-mediated 
localization or secretion of a protein. 

7 1 . The method of claim 64 or 65, wherein testing the effect of the agent on a 
POSH function comprises testing the effect of the agent on the interaction of 

25 POSH with a POSH-AP. 

72. The method of claim 71 , wherein the POSH-AP is a small GTPase. 
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73 . The method of claim 72, wherein the small GTPase is selected from the 
group consisting of: ARF1, ARF5, and RALA. 

74. The method of claim 64 or 65, wherein the test agent is selected from among: 
an antisense nucleic acid, an siRNA construct, a small molecule, an antibody 

5 and a polypeptide. 

75. A method of identifying an agent that modulates a HERPUD1 function, 
comprising: 

a) identifying an agent that modulates POSH; and 

b) testing the effect of the agent on a HERPUD1 function. 

A method of evaluating an agent that modulates an HERPUD1 function; 
comprising: 

a) providing an agent that modulates POSH; and 

b) testing the effect of the agent on a HERPUD1 function. 

The method of claim 75 or 76, wherein testing the effect of the agent on a 
HERPUD1 function comprises contacting a cell with the agent and 
measuring the effect of the agent on ubiquitination of HERPUD1 in the cell. 



.10 76. 



77. 



15 



78. A method of treating a viral infection in a subject in need thereof, comprising 
administering an agent that inhibits MSTP028 in an amount sufficient to 

20 inhibit viral infection. 

79. The method of claim 78, wherein said agent is selected from the group 
consisting of: an siRNA construct, a small molecule, an antibody, and an 
antisense construct. 

80. The method of claim 79, wherein the agent is an siRNA construct comprising 
25 a nucleic acid sequence that hybridizes to an mRNA encoding the MSTP028. 
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81. Amethodof inhibiting an activity of aPOSH-AP in acell, comprising 
contacting the cell with an inhibitor of POSH. 

82. The method of claim 81 , wherein the POSH-AP comprises a polypeptide 
selected from the group consisting of: PKA, SNX1, SNX3, ATP6V0C, 

5 PTPN12, PPP1CA, GOSR2, CENTB1, DDEF1, ARF1 , ARP5, PACS-1, 

EPS8L2, HERPUD1, UNC84B, MSTP028, GOCAP, EIF3S3, SRA1, CBL- 
B, RALA, SIAH1, SMN1, SMN2, SYNE1, TTC3, VCY2IP1 and UBE2N 
(UBC13). 

83. The method of claim 81, wherein the inhibitor of POSH is selected from 
10 among the following: 

i) an agent that inhibits a POSH activity; and 

ii) an agent that inhibits expression of a POSH. 

84. The method of claim 83, wherein the POSH activity is ubiquitin ligase 
activity. 

15 85. A method of treating a POSH-associated disease in a subject, comprising 
administering a POSH-AP inhibitor to a subject in need thereof. 

86. The method of claim 85, wherein said POSH-AP inhibitor is an agent 
selected from the group consisting of: 

i) an agent that inhibits a kinase activity of the POSH-AP; 
20 ii) an agent that inhibits expression of the POSH-AP; 

iii) an agent that inhibits the ubiquitin ligase activity of the POSH-AP; 

iv) an agent that inhibits the phosphatase activity of the POSH-AP; 

v) an agent that inhibits the GTPase activity of the POSH-AP; and 

vi) an agent that inhibits the ubiquitination of the POSH-AP. 
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&7. The method of claim 85, wherein the POSH-associated disease is a viral 
infection. 

88. The method of claim 85, wherein the POSH-associated disease is a POSH- 
associated cancer. 

5 89. The method of claim 85, wherein the POSH-associated disease is a POSH- 
associated neurological disorder. 

90. A method of identifying an anti-viral agent, comprising: 

a) forming a mixture comprising a POSH polypeptide, a POSH-AP and 
a test agent; and 

j0 b) detecting phosphorylation of the POSH polypeptide, 

wherein an agent that inhibits phosphorylation of POSH is an anti-viral 
agent. 

91 . A method of identifying an anti-viral agent, comprising: 

a) forming a mixture comprising a POSH polypeptide, a POSH-AP, 
1 5 ubiquitin and a test agent; and 

b) detecting ubiquitination of the POSH-AP, 

wherein an agent that inhibits ubiquitination of the POSH-AP is an anti-viral 
agent. 

92. The method of claim 91, wherein the POSH-AP is HERPUD1. 
20 93 . A method of identifying a modulator of POSH, comprising: 

a) forming a mixture comprising a POSH polypeptide, a POSH-AP and 
a test agent; and 

b) detecting phosphorylation of the POSH polypeptide, 



9372369_1 

- 168 



wherein an agent that alters phosphorylation of POSH is an agent that 
modulates POSH. 

94. A method of identifying a modulator of POSH, comprising: 

a) forming a mixture comprising a POSH polypeptide, a POSH-AP, 
ubiquitin and a test agent; and 

b) detecting ubiquitination of the POSH-AP, 

wherein an agent that inhibits ubiquitination of the POSH-AP is an agent that 
modulates POSH. 

95. The method of claim 91, wherein the POSH-AP is HERPUD1. 

96. A method of treating or preventing a POSH associated cancer in a subject 
comprising administering an agent that inhibits a POSH-AP to a subject in 
need thereof, wherein said agent treats or prevents cancer. 

97. The method of claim 96, wherein the POSH-AP comprises a polypeptide 
selected from the group consisting of: PKA, SNX1, PTPN12, PPP1CA, 
CENTB1, ARF1, ARF5, EPS8L2, EIF3S3, CBL-B, RALA, SIAH1, TTC3, 
ATPVOC, and VCY2IP1 . 

98. The method of claim 96j wherein the cancer is associated with increased 
POSH expression. 

99. A method of treating or preventing a POSH-associated neurological disorder 
in a subject comprising administering an agent that inhibits a POSH-AP to a 
subject in need thereof, wherein said agent treats or prevents the neurological 
disorder. 

1 00. The method of claim 99, wherein the POSH-AP comprises a polypeptide 
selected from the group consisting of: PTPN12, DDEF1, EPS8L2, 
HERPUD1, GOCAP, CBL-B, SIAH1, SMN1, SMN2, TTC3, SPG20, SNX1, 
and ARF1. 
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101. 
102. 

5 

103. 

10 

104. 
105. 

15 



A method of treating a neurological disorder comprising administering an 
agent to a subject in need thereof, wherein said agent, inhibits the ubiquitin 
ligase activity of POSH. 

A method of treating a neurological disorder comprising administering an 
agent to a subject in need thereof, wherein said agent inhibits the 
ubquitination of a POSH- AP. 

The method of claim 101 or claim 102, wherein the neurological disorder is 
selected from among: Alzheimer's disease, Parkinson's disease, 
Huntington's disease, schizophrenia, Niemann-Pick's disease, and prion- 
associated diseases. 

The use of an agent of claim 103, wherein the neurological disorder is 
Alzheimer's disease. 

The method of claim 101 or claim 102, wherein said agent is selected from 
the group consisting of: an siRNA construct, a small molecule, an antibody, 
and an antisense construct. 

The method of claim 105, wherein the small molecule is selected from 



among: 




CI 
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107. The method of claim 1 02, wherein the POSH-AP is HERPUD1 . 

10 108. The method of claim 6 1 , wherein the siRNA construct inhibits the 

expression of MSTP028 and is selected from the group consisting of: 5'- 
AAGTGCTC ACCG AC AGTGAAG-3 ' and 5'- 
AAG AT ACTT ATG AGCCTTTCT-3 ' . 
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POSH INTERACTING PROTEINS AND RELATED METHODS 

ABSTRACT 

The application provides novel complexes of POSH polypeptides and 
POSH-associated proteins. The application also provides methods and compositions 
for treating POSH-associated diseases such as viral disorders, cancer, and 
neurological disorders. 
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Figure 1: Human POSH Coding Sequence (SEQ ID NGrlTCpart 1) 

ATGGATGAATCAG CCTTGTTGGATCTTTTGGAGTGT CCCSGTGTGT CTAGAGCGCCTTGATG CTTCTGCGA 

CAGATGTCCCGAGTGCAGGACTCTTGTTGGCTCGGGTGTCGAGGAGCTTCCCAGTAAC^TCT 

AGACTTCTGGATGGCATCAAACAGAGGCCTTGGAAACCTGGTCCTGGTGGGGGAAGTGGGACCAACTG^ 

CAAATGCATTAAGGTCTCAGAGCAGCACTGTGGCTAATTGTAGCTCAAA^ 

CGGACAGCAG CCT CGGGTGCAAT CCTGGAGCCCC CCAGTGAGGGGTATACCTCAGTTACCATGTGC^AA 
GCGTTATACAACTATGAAGGAAAAGAG CCTGGAGACCTTAAATTCAGCAAAGGCGACAT CATCATTTTG C 
GAAGACAAGTGGATGAAAATTGGTACCATGGGGAAGTCAATGGAATCC^TGGCT 
TGTGCAGATTATT AAACCGTTAC CTCAGCCCCCAC CT 

GACAAGGAAGCAGACAAAGATTG CCTTCCATTTG CAAAGGATGATGTTCTGACTGTGATCCGAAGAGTGG 

ATGAAAACTGGGCTGAAGGAATGCTGGCAGACAAAATAGGAATATTTCCAATTTCATA 

CTCGGCTG CTAAG CAGCTGATAGAATGGG ATAAGCCTC CTGTG CCAGGAGTTGATGCTCaGAGAATGTTC C 

TCGGCAGCAGCCCAGAGCAGCACTGCCCCAAAGCACTCC 

CCTTCACTTCCCTCACTATGGCCAACAAGTCCTCCCA 

CCCCCCTGTCCTCA.TCAGCTCCAGCAACCCCACTGOTGCTGCACGGATCAGCGA 

TGCAGTGCC C CTT CTCAGGTTCATATAAGTACCAC CGGGTTAATTGTGACCCCGC CCC CAAGCAGCC CAG 
TGACAACTGGCCCCTCGTTTACTTTCCCATCAGATGTTCCCTACCAAGCTGCCCTTGGAACTTTGAATCC 

TCCTCTTC<^CC&CCCCCTCTCCTGGCTGCCAC^ 

GCTGCTGCTGCTGGAATGGGACCGAGGCCCATGGCAGGATCCACTGACCAGATTGCACATT^ 
AGACTCGCCCCAGTGTGTATGTTGCTATATATCCATACACTCCTCGGAAAGAGGATGAACTAGAGCTGAG 
AAAAGGGGAGATGTTTTTAGTGTTTGAGCX3CTGC CAGGATGG CTGGTTCAAAGGGACAT CCATG CATACC 
AGC^GATAGGGGTTTTCCCTTCCAATTATGT^ 

CTAAAGTCCCTATGT CTACAGCTGGC CAGACAAGTCGGGGAGTGACCATGGTCAGTCCTTCCACGG CAGG 
AGGGCCTGCCCAGAAGCTCCAGGGAAATGGCGTGGCTGGGAGTCCCAGTGTTGTCCCCGCAGCTGTGGTA 

TCAGCAGCTCACATCCAGACAAGTCCTCAGGCTAA 
ACCAGGCCCGG^TGCTGTGAGGACAGTTGCAGCGCACAACCAGG 

CATCCAGGTACAGAATGCCGCCGGCCTCAGCCCTGCATCTGTGGGCCTGTCCC^TCACTCGCT 
CCACAACCTGCGC CTCTGATGCCAGGCTC^GCCACGCACACTG CATCAGTATCAGT CGAGCCAGTG 
CCCCTCTGGCCTGTGCAGCAGCTGCrCCACTGACTTCCCCAAGCA 

G CCCAGTGGCCGGATAGTGAC CG TXCTCCCTGGACTC CCCACATCTCCTGACAGTGCTTCZATCAGCTTGT 
GGGAACAGTTCAGCAACCAAACGAGACAAGGATAGCAAAAAAG^ 

CTGGCG C CTC CAC TAAACGGAAG CC C CGCGTGT CT CCTC CAG CATCGCC CAC C CTAGAAGTGGAG C TGGG 
CAGTGCAGAGOTTCCTCTCCAGGGAGCGGTGGGGCCCGAACTGCCACC^GGAGGTGGCCATGGCAGGGC^ 
GGCTCCTGCCCnX5TGGACGGGGACGGACCGGTCACGACTGCAGTGGCAGGAGCAGCCCTGGCCCAGGATG 
CTTTTCATAGGAAGGCAAGTTCCCTGGACTCCGCAGTTCCCATCGCTCCACCTCCTCGCCAGGCCT 
CTCCCTGGGT CCTGTCTTGAATGAGT CTAGACCTGTCGTTTGTGAAAGGCACAGGGTGGTGGTTTCCTAT 
" C CTC CTCAGAGTGAGGCAGAACTTGAACTTAAAGAAGGAGATATTGTGTTTGTTCATAAAAAACGAGAGG 
' ATGGCTGGTT(^^GGCAC^TTAC^CGTAATGGGAAAACTGGCCTTTTCCC^GGAAGCTTTGTGGAAAA 

CATATGA 
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Figure 2: Human POSH Amino Acid Sequence (SEQ ID NO:2) (part 2) 



SAAAQS STAP KHSDTKKNTKKRHS FTSIiTMANKS SQAS QNRHS ME ISP PVLI S S SNPTAAARI SBL SGI»S 
CSAPSQVHISTTGLIVTPPPSSPVTTGPSFTFPSDVPYQAALGTI^ 
AAAAGMGPRPMAGSTDQIAHLRPQTRPSVYVAIYPCT^ 
SKIGVPPGNYVAPVTRAVT^QAKTOMSTAGQTSRGVTOT 
SAAHIQTSPQAKVIJiHMTGQMTVNQARNAVRTOAAHirQEROT 

PQPAPLMPGSATHTAAIS ISRASAPLACAAAAPLTSPS ITSASLEAEPSGRIVTVLPGLPTSPDSASSAC 
GNSSATKPDKDSKKEKKGLLKLLSGASTKRKPRVSPPASPTIiEVEIiGSAEIjPIjQGAVGPELPPGGGHGRA 
GS CPVDGDGP VTTAVAGAALAQDAFHRKASSLDSAVP IAPPPRQACS SLGPVIiNE SRP WCERHRWVS Y 
PPQSEAELELKEGDIVFVHKKREDGWFKGTLQRNGKTGIiFPGSFVENI 
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Figure 3: Human POSH cDNA Sequence (SEQ ID NO:3) 
c^gagacactgcgagcgg^^ 



GC^TGCAC^CflACTRTCaACATTTCTGAA^CTT^CTC. GTCTT GCCTTGC 

sssssssssssssssssss^ 



gcaggactcttgttggctc^gtg^^ 

(^TCAAACAGAGGCCTTGGAAACCTGGTCCTGGTGGGGGAAGTGGGACCAACW 
TCTCAGAGCAG(ScTGTGGCTAATTC 

=^^^= 
^^^^^^^^^^^ 

TCAGGTTCATATAAGTACCACCGGGTTAATTGTGACTC^^ 

SaSg^ga^ 



tSga^g^ag^SSc^ 
gSg^tcctcSctc^^ 

Sgaccg^ctccctcgactccccacatctcctgacagtgcttcatc^^ 
aaccajSccagacaaGgatagcaaaaaag " 

^ScATCACAACGTAATGGGAAAACTCMCCTTTTCCCAGGAAG^ 
GACA^TGAAGAAGCTTAAAATCACTTCACACAACAAAGTAGCACAAAGC^ 

TTGTGGACTTC CAGATGGTCAGGAGATGAG CAAAGGATTGGTATGTGACT CTGATGC CC C^GCACAGTTA 

ATTGTTTACAAGGCTTAACTAATTTATTTGCTTTTTTAAACTTGAACT 
TTGGATTATGATTTTAAGAAATTATT^ 

TGAGAGCAAGAGATT CGTTTTGACATAGAGTGAATG CATTTT CCC CT CT C CTC CTCCCTGCTACCATTAT 

^TGG^TTATGTTTTGCTTCTTTAAGATAGAAATCCCAGTTCTCTAATT^ 

AACCAAACATACAAATGAATCAGTATCAATTAGGGCCTG^^ 

AATGTAT1TAGAAATATATTTAGTTTTATAGCAGAAGCAGCTCAATTGTTTGGTTGGAAAGTAGGGGAAA 

SgtStag^cS 

TGCCCAGGACACAAGTAAAACATTTGTGAGATAGTGGTGGTAAGTGATGCACT CGTGTTAAGT CAAAGGC 

1ATAAGAAACACTGTGAAAAGTTCATATTCATCCATTGTGATTCTTTCCCCACGT 

G^TTCCCA^GTAATATAGACTGTGCATGGTGTGTATATCT 

ctScagaattcaccaato 



-to be continued 
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Figure 3: Human POSH cDNA Sequence (SEQ ID NO:3) 

CTGACCAAGGTCTCTTCAGTGCACTCGCTCCCTCTCT^ 

TTAGTGAAAGTGGTCTTTTATGTCCTCCCAGCAGACAGACATCAAGGATGAGT^ 

CTGTGACTGTGGAGCTCTGGAAGGCTTGGTGGGAGTGAATTTGCCCACACCTTACAATTGTGGCAGGATC 
CAGAAGAGCCTGT CTTTT TAT AT C CATT C CTTGATGT CATTGGC CT CTCCCACCGATTTCATTACGGTGC 
CACGC AGT CATGG AT CTGGGTAGT C CGGAAAACAAAAGGAGGGAAGACAGCCTGGTAATGAATAAGAT C C 
TTACCACAGTTTTCT CATGGGAAATACAT AATAAACC CTTTCAT CTTTTTTTTTTT C CTTTAAGAATTAA 
AACTGGGAAATAGAAACATGAACTGAAAAGTCTTGCAATGAC^GAGGTTTCATGGTCTT 

TC CTGTGTGTTGAATTTAAAAAAAAAAAATACTT TACTTGGATATTCATGTAATATATAAAGGTTTGGTG 
AAATGAACTTTAGTTAGGAAAAAGCTGGCAT CAGCTTTCAT CTGTGTAAGTTGACAC CAATGTGT C ATAA 
TATTCTTTATTTTGGGAAATTAGTGTATTTTATAAAAATT^ 

GATAATTTTTTTACC TGT CTTTT CT C CAT ATTTTAAGCT ATG TGAT TGAAGTAC CT CTGTT CATAGTTT C 
CTGGTATAAA.GTTGGTTAAAATTTCATCTGTTAATAGAT CATTAG GTAATATAATGTATGGGTTTT CTAT 
TGGT T TTT TG CAGACAGTAGAGGGAGATT TTGTAACAAGGG C TTGTTACACAGTGATATGGTAATGATAA 
AATTG CAATTTAT CACTC CTTTT CATGTTAATAAT TTGAGGACTGGATAAAAGGT TT CAAGAT TAAAATT 
TGATGTTCAAACCTTTGT 
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Figure 4: 5' cDNA fragment of human POSH (public gi:1043261 1; SEQ ID NO:4) 

ctgagagacactgcgagcggcgagcgcggtggggccgcatctgcatcagccgccgcagccgctgcggggc 

cgcgaacaaagaggaggagccgaggcgcgagagcaaagtctgaaatggatgttacatgagtcattttaag 

gatgcacacaactatgaacatttctgaagattttttctcagtaaagtagataaagatggatgaatcagcc 

ttgttggatcttttggagtgtccggtgtgtctagagcgccttgatgcttctgcgaaggtcttgccttgcc 

agcatacgttttgcaagcgatgtttgctggggatcgtaggttctcgaaatgaactcagatgtcccgagtg 

caggactcttgttggctcgggtgtcgaggagcttcccagtaacatcttgctggtcagacttctggatggc 

atcaaacagaggccttggaaacctggtcctggtgggggaagtgggaccaactgcacaaatgcattaaggt 

ctcagagcagcactgtggctaattgtagctcaaaagatctgcagagctcccagggcggacagcagcctcg 

ggtgcaatcctggagccccccagtgaggggtatacctcagttaccatgtgccaaagcgttatacaactat^ 

gaaggaaaagagcctggagaccttaaattcagcaaaggcgacatcatcattttgcgaagacaagtggatg 

aaaattggtaccatggggaagtcaatggaatccatggctttttccccaccaactttgtgcagattattaa 

accgttacctcagcccccacctcagtgcaaagcactttatgactttgaagtgaaagacaaggaagcagac 

aaagattgccttccatttgcaaaggatgatgttctgactgtgatccgaagagtggatgaaaactgggctg 

aaggaatgctggcagacaaaataggaatatttccaatttcatatgttgagtttaactcggctgctaagca 

gctgatagaatgggataagcctcctgtgccaggagttgatgqtggagaatgttcctcggcagcagcccag 

agcagcactgccccaaagcactccgacaccaagaagaacaccaaaaagcggcactccttcacttccctca 

ctatggccaacaagtcctcccaggcatcccagaaccgccactccatggagatcagcccccctgtcctcat 

cagctccagcaaccccactgctgctgcacggatcagcgagctgtctgggctctcctgcagtgccccttct 

caggttcatataagtaccaccgggttaattgtgaccccgcccccaagcagcccagtgacaactggcccct 

cgtttactttcccatcagatgttccctaccaagctgcccttggaactttgaatcctcctcttccaccacc 

ccctctcctggctgccactgtccttgcctccacaccaccaggcgccaccgccgccgctgctgctgctgga 

atgggaccgaggcccatggcaggatccactgaccagattgcacatttacggccgcagactcgccccagtg 

tgtatgttgctatatatccatacactcctcggaaagaggatgaactagagctgagaaaaggggagatgtt 

tttagtgtttgagcgctgccaggatggctggttcaaagggacatccatgcataccagcaagataggggtt 

ttccctggcaattatgtggcaccagtcacaagggcggtgacaaatgcttcccaagctaaagtccctatgt 

ctacagctggccagacaagtcggggagtgaccatggtcagtccttccacggcaggagggcctgcccagaa 

gctccagggaaatggcgtggctgggagtcccagtgttgtccccgcagctgtggtatcagcagctcacatc 

cagacaagtcctcaggctaaggtcttgttgcacatgacggggcaaatgacagtcaaccaggcccgcaatg 

ctgtgaggacagttgcagcgcacaaccaggaacgccccacggcagcagtgacacccatccaggtacagaa 

tgccgccggcctcagccctgcatctgtgggcctgtcccatcactcgctggcctccccacaacctgcgcct 

ctgatgccaggctcagccacgcacactgctgccafccagtatcagtcgagccagtgcccctctggcctgtg 

cagcagctgctccactgacttccccaagcatcaccagtgcttctctggaggctgagcccagtggccggat 

agtgaccgttctccctggactccccacatctcctgacagtgcttcatcagcttgtgggaacagttcagca 

. accaaaccagacaaggatagc 
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Figure 5: N terminus protein fragment of hPOSH (public gi: 10432612; SEQ ID 
NO:5) 

MDESALLDLLECPVCLEIUiDASAKV^PCQ 

IUjLDGIKQRPWKPGPGGGSGTNCTNALRSQSSTVANCSSKDLQSSQGGQQPRVQSWSPPVRGIPQIjPCAK 
AIiYNYEGKBPGDLKPSKGDXI ILRRQVDENWYHGEVNGIHGFFPTNPVQI IKPLPQPPPQCKAJjYDFEVK 
DKEADKD CL P FAKDD VLTVI RRVDENWAEGMLADKIG I FP I S YVE FNS AAKQL I E WDKPP VPGVD AGE CS 
SAAAQSSTAPKHSOTKKNTKKRHSFTSLTMANKSSQASQNRHSMEISPPV 
CSAPSQVHISTTGLIVTPPPSSPVTTGPSFTFPSDVPYQAALGTIOT 
AAAAGMGPRPMAGSTDQIAHLRPQTRPSVYVAlYPYTPRKEDELEIiRKGE^ 

SKI GVFP GNYVAP VTRAVTNASQAKVPMSTAGQTSRG VTMVS P STAGGPAQKLQGNGVAGS PS WPAAW 
SAAHIQTSPQAKVLLHOTGQMTVNQARNAVRTVAAHNQERPTAAVTPIQVQN^ 

PQPAPLMPGSATHTAAISISRASAPIACAAAAPLTSPSrTSASLEAEPSGRIVTVLPGLPTSPDSASSAC 
GNSSATKPDKDS 



I 



( 
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Figure 6: 3' mRNA fragment of hPOSH (public gi:7959248; SEQ ID NO:6) 

atttcatatgttgagtttaactcggctgctaagcagctgatagaatgggataagcctcctgtgccaggag 
ttgatgctggagaatgttcctcggcagcagcccagagcagcactgccccaaagcactccgacaccaagaa 
gaacaccaaaaagcggcactccttcacttccctcactatggccaacaagtcctcccaggcatcccagaac 
cgccactccatggagatcagcccccctgtcctcatcagctccagcaaccccactgctgctgcacggatca 
gcgagctgtctgggctctcctgcagtgccccttctcaggttcatataagtaccaccgggttaattgtgac 
cccgcccccaagcagcccagtgacaactggcccctcgtttactttcccatcagatgttccctaccaagct 
gcccttggaactttgaatcctcctcttccaccaccccctctcctggctgccactgtccttgcctccacac 
caccaggcgccaccgccgctgctgctgctgctggaatgggaccgaggcccatggcaggatccactgacca 
gattgcacatttacggccgcagactcgccccagtgtgtatgttgctatatatccatacactcctcggaaa 
gaggatgaactagagctgagaaaaggggagatgtttttagtgtttgagcgctgccaggatggctggttca 
aagggacatccatgcataccagcaagataggggttttccctggcaattatgtggcaccagtcacaagggc 
ggtgacaaatgcttcccaagctaaagtccctatgtctacagctggccagacaagtcggggagtgaccatg 
gtcagtccttccacggcaggagggcctgcccagaagctccagggaaatggcgtggctgggagtcccagtg 
ttgtccccgcagctgtggtatcagcagctcacatccagacaagtcctcaggctaaggtcttgttgcacat 
gacggggcaaatgacagtcaaccaggcccgcaatgctgtgaggacagttgcagcgcacaaccaggaacgc 
cccacggcagcagtgacacccatccaggtacagaatgccgccggcctcagccctgcatctgtgggcctgt 
cccatcactcgctggcctccccacaacctgcgcctctgatgccaggctcagccacgcacactgctgccat 
cagtatcagtcgagccagtgcccctctggcctgtgcagcagctgctccactgacttccccaagcatcacc 
agtgcttctctggaggctgagcccagtggccggatagtgaccgttctccctggactccccacatctcctg 
acagtgcttcatcagcttgtgggaacagttcagcaaccaaaccagacaaggatagcaaaaaagaaaaaaa 
gggtttgttgaagttgctttctggcgcctccactaaacggaagccccgcgtgtctcctccagcatcgccc 
accctagaagtggagctgggcagtgcagagcttcctctccagggagcggtggggcccgaactgccaccag 
gaggtggccatggcagggcaggctcctgccctgtggacggggacggaccggtcacgactgcagtggcagg 
agcagccctggcccaggatgcttttcataggaaggcaagttccctggactccgcagttcccatcgctcca 
cctcctcgccaggcctgttcctccctgggtcctgtcttgaatgagtctagacctgtcgbttgtgaaaggc 
acagggtggtggtttcctatcctcctcagagtgaggcagaacttgaacttaaagaaggagatattgtgtt 
tgttcataaaaaacgagaggatggctggttcaaaggcacattacaacgtaatgggaaaactggccttttc 
ccaggaagctttgtggaaaacatatgaggagactgacactgaagaagcttaaaatcacttcacacaacaa 
agtagcacaaagcagtttaacagaaagagcacatttgtggacttccagatggtcaggagatgagcaaagg 
attggtatgtgactctgatgccccagcacagttaccccagcgagcagagtgaagaagatgtttgtgtggg 
ttttgttagtctggattcggatgtataaggtgtgccttgtactgtctgatttactacacagagaaacttt 
tttttttttttaagatatatgactaaaatggacaattgtttacaaggcttaactaatttatttgcttttt 
taaacttgaacttttcgtataatagatacgttctttggattatgattttaagaaattattaatttatgaa 
atgataggtaaggagaagctggattatctcctgttgagagcaagagattcgttttgacatagagtgaatg 
cattttcccctctcctcctccctgctaccattatattttggggttatgttttgcttctttaagatagaaa 
tcccagttctctaatttggttttcttctttgggaaaccaaacatacaaatgaatcagbatcaattagggc 
ctggggtagagagacagaaacttgagagaagagaagttagtgattccctctctttctagtttggtaggaa 
tcaccctgaagacctagtcctcaatttaattgtgtgggtttttaattttcctagaatgaagtgactgaaa 
caatgagaaagaatacagcacaacccttgaacaaaatgtatttagaaatatatttagttttatagcagaa 
gcagctcaattgtttggttggaaagtaggggaaattgaagttgtagtcactgtctgagaatggctatgaa 
gcgtcatttcacattttaccccaactgacctgcatgcccaggacacaagtaaaacatttgtgagatagtg 
gtggtaagtgatgcactcgtgttaagtcaaaggctataagaaacactgtgaaaagttcatattcatccat 
tgtgattctttccccacgtcttgcatgtattactggattcccacagtaatatagactgtgcatggtgtgt 
atatttcattgcgatttcctgttaagatgagtttgtactcagaattgaccaattcaggaggtgtaaaaat 
aaacagtgttctcttctctaccccaaagccactactgaccaaggtctcttcagtgcactcgctccctctc 
tggctaaggcatgcattagccactacacaagtcattagtgaaagtggtcttttatgtcctcccagcagac 
agacatcaaggatgagttaaccaggagactactcctgtgactgtggagctctggaacfgcttggtgggagt 
gaatttgcccacaccttacaattgtggcaggatccagaagagcctgtctttttatatccattccttgatg 
tcattggcctctcccaccgatttcattacggtgccacgcagtcatggatctgggtagfcccggaaaacaaa 
aggagggaagacagcctggtaatgaataagatccttaccacagttttctcatgggaaatacataataaac 
cctttcatctttttttttttcctttaagaattaaaactgggaaatagaaacatgaactgaaaagtcttgc 
aatgacaagaggtttcatggtcttaaaaagatactttatatggttgaagatgaaatcattcctaaattaa 
ccttttttttaaaaaaaaacaatgtatattatgttcctgtgtgttgaatttaaaaaaaaaaaatacttta 
cttggatattcatgtaatatataaaggtttggtgaaatgaactttagttaggaaaaagctggcatcagct 
ttcatctgtgtaagttgacaccaatgtgtcataatattctttattttgggaaattagtgtattttataaa 
aattttaaaaagaaaaaagactactacaggttaagataatttttttacctgtcttttctccatattttaa 
gctatgtgattgaagtacctctgttcatagcttcctggtataaagttggttaaaatttcatctgttaata 
gatcattaggtaatataatgtatgggttttctattggttttttgcagacagtagagggagattttgtaac 
aagggcttgttacacagtgatatggtaatgataaaattgcaatttatcactccttbtcatgttaataatt 
tgaggactggataaaaggtttcaagattaaaatttgatgttcaaacctttgt 
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Figure 7: C terminus protein fragment of hPOSH (pubUc gi:7959249; SEQ ID 
""NO:7) 

" I S YVE FNSAAKQL I EWDKPP VPGVDAGE CSSAAAQSSTAPKHSDTKKNTKKRHS FTSIiTMANKSSQASQN 
RHSWEISPPVLISSSNPTAAARISELSGLSCSAPSQVHISTTGIiIVTPPPSSPVTTGPSPTFPSDVPYQA 

ALGTLNPPLPPPPLLAATVIiASTPPGATAAAAAAGM^ 
EDEIiELRKGEMFLVFBRCQDGWFKGTSMHTSKIGVFPGNYVAPV^ 

VS PSTAGGP AQKLQGNGVAGS PS VVPAAWS AAHI QTS PQAKATLLHMTGQMTVN^ R 
PTAAVTP I QVQNAAGLSPASVGLSHHSIiAS PQPAPLMPGSATHTAAI S I SRASAPLACAAAAPLTS PS I T 
SASLE AE PSGRI VT VLPGLPTSPDSAS SACGNS SATKPDKDS KKE KKGLIiKIiIj SGASTKRKPRVSP PAS P 
TLE VELGSAE LPLQGAVGPELP PGGGHGRAGS CP VD GDGPVTTAVAGAAIiAQD AFHRKASS LDS AVP I AF"~ 
PPRQACSSLGPVIiNESRPWCERHRVWSYPPQSEAEI^L^GDI 
PGSFVENI 
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Figure 8: Human POSH full mRNA, Annotated Sequence (part 1) 



- gi| 10432611 | dbj | AK021429.1) AK021429 Homo sapiens CDNA 

FLJ11367 fis, clone HEMBA1000303 , highly similar to Mus musculus 
Plenty of SH3s (POSH) mRNA 

- g i| 7959248 | dbj | AB040927 . 1 | AB040927 Homo sapiens mRNA for 

KIAA1494 protein, partial cds 

- Both hPOSH and KIAA1495 

- Ring Domain 

- SH3 Domian 

- start codon and stop codon of predicted ORF 

CTGAGAGACACTG CGAGCGGCGAGCG CGGTGGGG CCGCATCTG CATCAGCCGC CGCAG CCGCTGCGGGGC 
CG CGAACAAAGAGGAGGAG CCGAGGCGCGAGAG CAAAGT CTGAAATGGATGTTAGATGAGT CATTTTAAG 
GG AT GC A.C ACAACTATG AACATTTCTGAAG ATTT TT 
CTTGTTGGATCTTTTGGAG 



GCAGGACT CTTGTTGGCTCGGGTGTCGAGGAGCTTC C CAGTAACATCTTG CTGGT CAGACTTCTGGATGG 
CAT CAAACAG AGG CC TTGGAAAC CTGGT C C TGGTGGG GGAAGTGGGA C CAACTGCAC AAATG CATT AAGG 
T CT CAGAGCAG CACTGTGG CT AAT TG TAG CT CAAAAGAT CTG CA GAG CTC C CAGGG CGGACAG C AG C CT C 
GGGTGCAATCCTGGAGCCCCCCAGTGAGGGGTATACCTCAGTXfi ~ 




\GAAAAAAAGGGTTTGTTGAAGTTGCTTTCTGGCGCCTCCACT 
i^CGGAAGCCCCGCGTGTCTCCTCCAGCATCXSCCCACCCTAGAAGTGGAGCTGGGC^GTGCAGAGCTTC 
CTCTCCAGGGAGCGGTGGGGCCCGAACTG CCACCAGGAGGTGGCCATGGCAGGGCAGG CTC CTGCC CTGT 
GGACGGGGACGGACCGGTCACGACTGCAGTGGC^AGGAGCAGCCCTGGCCCAGGATGCTTTTCATAGGAAG 
GCAAGTTCCCTGGACTCCGCAGTTCCCATCGCTCCACCTCCTCGCCAGGCCTGTTCCTCCCTC 
TCTTGAATGAGTCTAGACCTGTCGTTTGTE * 




____ gGGAGACT 

GACACTGAAGAAGCTTAAAATCACTTCACACIAACAAAGTAGCAC^ 



-to be continued 
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Figure 8: Human POSH full mRNA, Annotated Sequence (part 2) 



TTGTGGACTT CCAGATGGTCAGG AGATGAGCAAAGGATTGGTATGTGACT CTGATGCCCCAGCACAGTTA 
CCCCAGCGAGCAGAGTGAAGAAGATGTTTGTGTGGGTTTTGTTAGTCTGGATTCGGATGTATAAGGTGTG 
C CT TGTACTGTCTGATTTACT ACACAGAGAAACTT XT TTTTT TTTTTAAGATATATGA CTAAAATGGACA 
ATTGTTTACAAGGCTTAACTAATtrTATTTGCTTTTT^ 

TTGG ATT ATGATT TT AAG AAATT ATTAATTT ATGAAATGATAGGTAAGGAGAAGCTGG AT T AT CTC CTGT 
TGAGAGCAAGAGATTCGTTTTGACATAGAGTGAATG CATTTTCCCCT CTC CTCCT CCCTGCTACCATTAT 
ATTTTGGGGTTATGTTTTGCTTCTTTAAGATAGAAATCCCAGTTCTCTAATTT 

AAC CAAACATACAAATGAATCAGT AT CAATTAGGG C CTGGGGTAGAG AGA CAG AAACTTGAG AGAAG AGA 
AGTTAGTGATTCC CT CTCTTT CTAGTTTGGTAGGAATCACCCTGAAGACCTAGTCCTCAATTTAATTGTG 
TG GGTTT TTAATTTT C CT AGAATG AAGTGACTGAAAC AATGAGAAAG AAT ACAG CAC AAC C CTTGAACAA 
AATGTAT TTAGAAATATATTTAGTTTT AT AGCAG AAGCAGCT CAATT GTT TGGTTGGAAAGTAGGGGAAA 
TTGAAGTTGTAGTCACTGTCTGAGAATGGCTATGAAGCGTCATTTCAC^ 

TGCCCAGGACACAAGTAAAACATTTGTGAGATAGTGGTGGTAAGTGATGCACTCGTGTTAAGTCAAAGGC 
TATAAGAAACACTGTGAAAAGTT CATATT CATCCATTGTGATTCTTT CCCCACGT CTTGCATGTATTACT 
GGATTCCCACAGT AATATAGACTGTGCATGGTGTGTATATTTCATTG CGATTT CCTGTTAAGATGAGTTT 
GTACTCAGAATTGACCAATTC^GGAGGTGTAAAAATAAACAGTGTTCTCTT^ 

CTGAC CAAGGT CT CTTCAGTG CACTCGCT CCCTCTCTGG CTAAGG CATGCATTAG CCACTACACAAGT CA 

TTAGTGAAAGTGGTCTTTTATGTCCTCCCAGCAGA<^GACATCAAGGATGAGTTAACCAGGAGACT 

CTGTGACTGTGGAGCTCTGG2VAGGCTTGGTGGGAGTGAATTTGCCCACACCTTACAATTGTGGCAGGATC 

CAGAAGAGCCTGTCTTTTTATATCCATTCCTTGATGT CATTGGCCT CTCC CACCGATTTCATTACGGTGC 

CACG CAGT CATGGAT CTGGGT AGT C CGG AAAACAAAAGGAGGGAAGA CAG C CTGGTAATGAAT AAGAT C C 

TTAC CACAGTTT T CT CATGGGAAATA CATAATAAAC C CTTT CAT C TTTTT TTT TTT C CTTT AAGAATT AA 

AAOTGGGAAATAGAAACATGAACTGAAAAGTCTTGCAATGACAAGAGGTTTCATGGTCT^ 

TT TATATGGTTGAAGATGAAAT CATT C CTAAATTAAC CTTTT TTT TAAAAAAAAA CAATGTATATT ATG T 

TCCTGTGTGTTGAATTTAAAAAAAAAAAATACTO^ 

AAATGAACTTT AGTTAGGAAAAAG CTGG CATCAG CT TT CATCTGTGT AAGTTGACACCAATGTGTCATAA 
TATT CTTTATTTTGG GAAATTAGTGT ATTTTAT AAAAAT TTT AAAAAGAAAAAAG ACTACTACAGGTTAA 
GATAATTTTTTTAC CTGT CTTTT CTC CAT ATTTTAAGCTATGTGATTGAAGTAC CT CTGTT CATAGTTT C 
CTGGTATAAAGTTGGTTAAAATTT CATCTGTTAATAG AT CATTAGGT AAT ATAAT GTATGGGTTTT CTAT 
TGGTTTTTTG CAG ACAGTAGAGGGAG ATTTTGT AACAAGGG CTTGTT ACACAGTGAT ATGGT AATGATAA 
AATTG CAATTTAT CACTC CTTTT CATGTTAATAATTT GAGGACTGG ATAAAAG GT TTCAAGAT TAAAATT - 
TGATGTTCAAACCTTTGT 
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Figure 9: Domain Analysis of Human POSH 
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Figure 10: Diagram of Human POSH Nucleic Acids 
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Figure 1 1 : Reduction in Full Length POSH inRNA by siRNA Duplexes 
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Figure 12: POSH Affects Release of VLP from Cells 



(WT) (Posh+WT) 
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VLP 
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Figure 13: Release of VLP from Cells at Steady State 




12 3 4 
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Figure 14: Mouse POSH mRNA sequence (public gi:10946921; SEQ ID NO: 8) 

GGGCAGOGGGCTCQGCGGGGCTGCATCTACCAGCX3CTGCG6GGCOT 
GCGAGAGCAAAGTCTGAAATGGATGTTAGATGAATCACTTT^ 
AAGCCGTTTTCTCACTAAAGTCACTCAAGATGGATGAGTCTG CCTTGTTGGA 

GTGTCTAGAACG C CTGGATG CTT C CG CAAAGGT CTTA C C CTG CCAG CATAC CTTTTGCAAACG C TGTTTG 
CTGGGGATTGTGGGTTCCCGGAATGAACTCAGATGTCCCGAATGCCGGACTCTTGTTGGCTCTGGGGTCG 
ACG AG CT CC C CAGTAACAT C CTACTGGT CAGACTT CTGGATGGCATC AAG CAGAGGCCTTGGAAAC CCGG 
CCCTGGTGGGGGCGGCGGGACCACCTGCACAAAC^CATTAAGGGCGCAGGGCA 

GG CTCG AAAGAT CTG CAGAG CT C C CAGTGTGGACAG CAG CC T CGGGTGCAAGCCTGGAGCC CC C CAGTGA 
GGGGAATAC CT CAGTTAC CGTGTG C CAAAGGA.TT ATATAACT ACGAAGGAAAAGAGC C CGGAGAC CTTAA 
GTTCAGCAAAGG CGACACCATCATTCTGCGCCGACAGGTGGATGAGAATTGGTAC CACGGGGAAGTCAGC 
GGGGTCCACGGCTTTTTCCCCACTAACTTCGTGCAGATCATCAAACCTTTAC 
GCAAAGCACH!TTACGACTTTGAAGTGAAAGACAAGGAA 

CGACGTACTGACCGTGAT CCGCAGAGTGGATGAAAACTGGGCTGAAGGAATGC TGGCAGATAAAATAGGA 
AT AT TT C CAAT T T CATACGTGGAGT TTAA CT CAG CTG CCAAG CAG CT GAT AGAGTGGGATAAG C CT CC CG 
TGCCAGGAGTGGACACGGCAGAATGCCCCTCAGCGACGGCGCAGAGC^VCCTCTGCCTCAAAGCACCCCGA 

CACCAAGAAGAACACCAGGAAGCGACACTCCrTCACCTCCCTCACC^ 

TCCCAGAACCGCCACT CCATGGAGAT CAGCCCTCCTGTG CTCATCAGTTC CAGCAACC CCACAGC CGCAG 
CCCGGATCAGCGAACTGTCCGGGCTCTCCTGCAGCGCCCCGTCTCAGGTCCAT^ 

AATTGTGACCCCACCCCCTAGCAGCCCGGTGACAACTGGCCCTGCGTTCACGTTCCCTTCAGATGTCCC^ 

TACCAAGCTGCCCTTGGAAGTATGAATCCTCCACTTpCCCC^CCCCCTCTCCrGGCGGCCACCG 

CCTCCACCCCGTCAGGCGCTACTGCTGCTGTTGCTGCTGCTGCTGCCGCCGCCGCCGCTGCTGGAATGGG 

ACCCAGGCCTGTGATGGGGTCCTCTGAAC^GATTGCACATTTACGGC CTCAGACT CGT CCCAGTGTATAT 

GTTGCTATATATCCGTACACTCCCCGGAAGGAAGACGAACTGGAGCTGAGGAAAGGGGAGATGTTTTTGG 

TGTTTGAGCGTTGCCAGGACGGCTGGTACAAAGGGACAT CGATGCATACCAGCAAGATAGG CGTTTTCCC 

TGGCAACTATGTGGCGCCTOTCACAAGGGO^TGACGAATGCCTCCCAAGCTAAAGTCT 

GCGGGT CAGG CAAGTCGCGGGGTGACCATGGTCAGCCCTTCCACTGCAGGAGGAC CTACACAGAAGCCC C 

AAGGAAACGGCGTGGCCGGAAATCCCAGCGTCGTCCCCACGGCTGTGGTGTCAGCAGCTCATATCCAGAC 

AAGT C CT CAGG CT AAGGT C CTGCTG CACATGT CTGGGCAGATGACAGT CAAT CAGGC C CGCAATG CTGTG 

AGGACAGTTG CAG CACAT AG CCAGGAACG CC C CACAG CAGCAGTGACT C C CAT C CAGGTCCAGAATGC CG 

CCTGCCTTGGTCCTGCATCCGTGGGCCTGCCCCAT(^TTCT 

GGGTCOTGCTGCCCACGGTGCTGCCXSTCAGC^TCAGTCGAACCAATGCCCCCATGGCCT 

GCTTCTCTGGCCrCCCCAAATATGACCAGTGCCATGTTGGAGACAGAGCCC^GTGGTCGC^ 

TCCTCCCTGGACTCCCCACATCTCCAGAGAGTGCTGCATCA^ 

AGACAAGGAC AGT AAGAAAGAAAAAAAGGGC CTACTG AAG CTG CT TT CTGGTG C CT C CAC CAAACG CAAG 

CCCCGAGTCTCCCCTCCAGCATCACCTACCCTGGATGTGGAGCTC 

GAGCAGTAGGTCCTGAGCTGCCGCTAGGGGGCAGCCACX3GCAGAGTG 

TGGTCCAGTGGCCGCTGGAACAGCAGCCCTAGCCCAGGATGCCOTCCACCGCAAGACAAGCTCCCT 
TCCGCAGTGCCCATTGCTCCACCACCTCGCC^GGCCTGCTCCTCCCTGGGCCCA 

GGCCTGTTGTTTGTGAAAGGCACAGGGTGGTGGTT TC CTACC CTC CT CAGAGTGAGGCCGAACTTGAACT 
CAAGG AAGGAGATATTGTGTTTGTT CATAAGAAACGAGAGGACGG CTGGT T CAAAGG CACGTT ACAGAGG 
AATGGGAAGACTGGCCTTTTCCCAGGGAGCTTTGTGGAAAACATCTGAGAAGACGGGACACGGAGAAAGC 
TTATCATCACACCACGTGTGACTAAAGAG CACAAAGCAGTTT CATAGAAAGAG CACAT CTGTGGACTTC C 
AGATCTTCAAGAACCGAGCAGAAGATGGGGACCTGACTCCAGAGCC^ 

AGGGAAGGAGGACACACCTGTGTGGGTTC CGTCT CTCTGGGTTCTGATGTGTAAAGTGTGCCTTGTAATG 
TCTAATGGACTTTACAGATAAAT GT CTTTTTTTT TTT AAGATGTATAACT AAAAT GGACAATT GTTTACA 
AGGCTTAACTAATTTATTTGCTTTTTTAAAACTTGAACTTTCTTGTAATAGCAAAT 
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Figure 15: Mouse POSH Protein sequence (Public gi: 10946922; SEQ ID NO: 9) 

MDESALLDLLECPVCLERLDASAKVIiPCQHTFCKRCLLG 
.^LDGIKQRPWKPGPGGGGGTTCTNTLRAC^STVW^ 

ALYNYEGKEPGDLKFSKGDTI ILRRQVDENWYHGEVSGVHGFFPTNFVQI IKPLPQPPPQCKALYDFEVK 
DKEAD KD CLP FAKDD VLTVI RRVDENWAEGMIiADKIG I FP I S YVE FNS AAKQL IE WDKP PVPGVDTAECP 
SATAQSTSASKHPDTKKNTRKRHSFTSIiTMANKSS 

CSAPSQVHISTTGLIVTPPPSSPVTTGPAFTFPSDVPYQAAIjGSMNPPIjPPP 
VAAAAAAAAAAGMGPRPVMGSSEQIAHLRPQTRPSVWAIYP 

KGT SMHTS KI GVFP GNYVAP VTRAVTNAS QAKVS MSTAGQ AS RGVTMVS P S TAGG PTQKP QGNG VAGNP S 
WPTAWSAAHIQTSPQAKVLLHMSGQMTVNQAIU^AVRTVAAHSQERPTAAVTPIQVQNAACLGPASVGL 
PHHSLASQPIjPPMAGPAAHGAAVS I SRTNAPMACAAGAS LAS PNMTS AMLETE PSGRTVTI IiPGLPTSPE"" 
SAASACGNSSAGKPDKDSKKEKKGLLKLIiSGASTKRKPRVSPPASPTIiDVELGAGEAPLQGAVGPELPLG 
GSHGRVGS CPTDGDGP VAAGTAAIiAQDAFHRKTS SLDSAVP I APP PRQACSSLGP VMNEARP WCERHR V 
WSYPPQSEAELELKEGDIVFVHKKREDGWFKGTLQRNGKTGIiFPGSFVENI 
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Figure 16: Drosophila melanogaster POSH mKNA sequence (public gi:17737480; 
SEQIDNO:10) 



CATTTGTATCCGCn?TGGCCACGAGCTTTGGCTGCACTTGGCAAACT 

CCTATTGCAACGATAATATAATCTGATTTAGTGCATTAAGAACGACAAGTAGCGATTATAATAGTAGATT 
TTAGCATTTGAGCTAAATTTATTTCC CAACCGCGTCTTGGGATTGCGTATGCGTGAGC CAGTACCTGCAT 
GTGTGTGTGTTTTGGAATGTG G C CC TG CACGAAATTCAAATAGTGACCAT C CTTG AGATTTTG CATACTG 
G CAAGATGGACGAGCACACGT TAAACGAC CTGTTGGAGTGCT C CGTGTGT CTTGAG CGACTGGACACCAC 
ATCGAAGGTGCTGCCATGCCAGCACACCTTCTGCCGCAAATGCTTGCAGGA^ 

AAGTTGCGATGCC CGGAGTGCCG CAT CCTGGTCTCTTGC AAAATTGATGAGCTGCCTCCAAACGTCTTGC 
TGATGCGAAT CTTAGAAGG CATGAAACAAAATGCAG CAG CTGG CAAAGG AGAAGAAAAGGGAGAGGAGAC 
TGAAACACAGCCGGAAAGGGCCAAACCTCAGCCGCCAGCGGAATCAGTGGCCCCGCCTGAC^CC^CTA 
CTCCAGCTGCAGTCACATCAGCAATCTCATCAGCCGGCTCGTCACT^ 

ACGCCTATGCCCTCTTTGACTTCGCCTCCGGTGAAGCCACCGATCTAAAGTTCAAGAAAGGGGATCTGAT 
ACTGATCAAGCATCGCATCGACAACAACTGGTTTGTGGGT 

ATCAA CTACGTCAAGGTATCG GTTC CGCTGC CCATGCCG CAGTGCATTGCCATGTATGACTTTAAGATGG 
GGCCCAACGACX3AGGAGGGATGC CT CGAATTTAAGAAAAG CACTGTAATACAGGTAATG CGCCGAGTTGA 
T CAT AATTGGGCAGAAGG ACGAATTGG C CAGACCAT CGG AAT CTT T C CAATAG CATT C GTTGAG CTG AAT 
GCAGCGGCCAAAAAG CTGTTGGACAG CGGGCTACACACCCATCCATT CTG CCATC CAC CGAAGCAACAGG 
GGCAGCGGGCCCTTCCTCCGGTTCCAGTTATTGATCCC^CGGTGGTCACGGAATCCAGTTCGGGATC 
CAATTCCACGCCGGGCAG CAGCAATTCAAGCTCCACATC CAGCTCGAATAACTGCAGT C CGAATCACCAA 
ATCTCACTGC CGAATACCC CC CAACATGTAGTAGCTT CCGG ATCGGCGTCTGTTCGTTTCCGTGACAAGG 
GAGCAAAGGAGAAACGCCACT CACTAAATGCTTTGCTGGGAGGAGGAGCTCCATTAAGT CTGCTGCAGAC 
CAACCGC CATTCKK3CTGAAATTCTTAGCCTGCCCCATGAACTAAGCCGCT TGGAAGTTTC C AGCTCAACA 
G C T CT AAAACC CACGT CAGC C CCACAGACAT CG CGTGTACTTAAGACCA CTGTT CAGCAG CAG ATGCAAC 
CG AATTT AC C CTGGGGATACT TAG C C CTGTT C C C ATACAAAC CACG C CAAACGGATGAG CTGG AATTAAA 
AAAGGGTTGTGTTTACATTGTGACOSAACGATGTGTGGACGGTTGGTTC^GGGAAAAAACTGGTTGGAC 
ATCACTGGAGTGTTCCCGGGGAACTACCTGACGCCCCTGCGCGCC^ 

AATGGAAATATGTT C CCCAAAATGCAGACGCC CAGATGG CACAAGTACAGCAGCATCCAGT TGCACCAGA 

TGTGCGACTGAACAACATGCTGTCCATGCAACCGCCTG 

ACGACCACCAGTTGCTCTGTGTGGTCGAAACCAGTGGAGGCGCTGTTC^ 

CTGAAACTGC CACAGC TT CGA CTACG AG CAG CAGTTC CT CTGGAGCAGTGGGACTTATGAGGAGAT TAAC 
T CACATGAAAACACG CTC CAAAT CT C CGGGAG CGTC C TTGCAGCAAGTT C CGAAAGAAGCTATTAG CACA 
AATGTGGAATTTACAACAAACCCATCAGCTAAATTGCATCC^ 

GTCAGCTGCAGCACAGTCAAC CG CTCAATGAAACTCCAG CAGCCAAGACAGCGGCACAACAACAGCAGTT 
CCTAC CCAAG CAG CTGC C TT C CG CTT CTA CG AACAGCGT T T CGTACGGATCG CAACGCGTG AAAGGAAG C 
AAGGAACGTCCTCACTTGATTTGCGCGAGACAATCATTAGATGCAGCTACATTTCG 

ATGCCGCGTCGCCGCCGCCACCTACTACTTCCGTGGCCCCAGCTGTCTACGCCGGCGGTCAGCAACAGGT 
GATTCCTGGAGGTGGAGCGCAATCCCAGTTG CATGCCAATATGATTATTG CACC CAGC CATCGGAAGT CG 
CACAG CCT AGATGCGAGT CAT GTGCTGAGT CCCAG CAG CAAT ATG AT CACGGAGG CGGC CATTAAGG C CA 
GCGCCACCACTAAGTCTCCTTACTGCACGAGGGAAAGTCGATTCCGCTC 

CAGTG ACATTGAACTAGAG CTACATTTGGG CGACATT AT CT A CGT CCAGCGGAAG CAGAAG AACGG CTGG 

TATAAGGGCACCCATGCCCX3TACCCACAAAACCK3GGCTGTTCCCCG 

AGGAAAGTTATGGTTCAAACTAGAATTTATTAAGCGAAATTCCAAATTACTT^ 

GTCGGTCTATTC<3GGCTTCCAAATACGCAATCTCATATTTCTCTTT^ 

CTTCCAATCGAATGGGCAGCTCGCCGTTGTACTTTTOT 

TAAGACTTAGGGAACAGTT AC TT AAG C CTTAG CGATTAGTTAGCTAGAGAAATAATCTAAC CG AT C CTTG 
TGCCCTCTACAAAGTTATTTGTAATATACGATACTCkGT^ 
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Figure 1 7: Drosophila melanogaster POSH protein sequence (public gi: 1773748 1 ; 
SEQIDNO:ll) • 

MDEHTLNDLLECSVCLERIJOT 

RILEGMKQNAAA.GKGEEKGEETETQPERAKPQPPAESVAPPDNQI»LQLQSHQQSHQPARHKQRRPLIiPHA 
YAIjFDFASGEATDIjKPKKGDIjIIj I KHRIDNNWFVGQANGQEGTFP INYVKVSVPLPMPQCIAMYDFKMGP 
NDEEGCLEFKKSTVIQWRRVDHNWAEGRIGQTIGI^^ 

RALPPVPVIDPTVVTESSSGSSNSTPGSSNSSSTSSSNNCSPNHQISIiPNTPQHVVASGSASVRFRDKGA 
KEKRHSI*NALLGGGAPLSLLQTNRHS AE I LSL PHEL SRIiEVS S STALKPTSAPQTSRVLKTTVQQQMQPN 
LPWGYIiALFP YKPRQTDEIiEIjKKGCVYIVTERCVDGWFKGKNWIiD I TGVFPGNYLTPLRARDQQQI*MHQW 
KYVPQNADAQMAQVQQHPVAPDVRlJlINMLSM 

TATASTTSSSSSGAVGLMRRLTHMKTRSKSPGASMQVPKEAISTNVEFTT^ 

I^HSQPLNETPAAKTAAQQQQFIiPKQLPSASTNSVSYGSQRVKGSKERPHIiICARQSLDAATFRSMYl^ 
ASP PPPTTSVAPAVYAGGQQQVI PGGGAQSQLHANMI IAPSHRKSHSLDASHVLS PS SNMI TEAAI KASA 
TT KS P YCTRE SRFRC I VP YP PNSD I E IjEIjHLGDI I YVQRKQKNGWYKGTHARTHKTGIiFPAS FVE PDC 
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Figure 18: POSH Domain Analysis 

. ^ hPOSH protein sequence : 
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C terminus protein fragment of hPOSH (public gi:7959249): 
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Mouse POSH Protein sequence (Public gi: 10946922): 
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Drosophila melanogaster POSH protein sequence (public gi:i773748i) 
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Figure 19: Human POSH has ubiquitin ligase activity 
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Figure 20 
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Figure 22. 
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Figure 25. 
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FIGURE 26 
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• .... Figure 27. 
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Figure 28. 
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Figure 29A. 
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Figure 29B. 
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Figure 30. Putative PKA phosphorylation sites in hPOSH. 
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Figure 31. Phosphorylation of hPOSH regulates binding of OTP-loaded Rac-1 . 
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FIGURE 36 

Unigene Name; Arfl Unigene ID; Hs. 286221 

Human Arfl raRNA sequence - varl (public gi: 3360490) (SEQ ID NO: 325) 

GCAAAACCAACGCCTGGCTCGGAGCAGCAGCCTCTGAGGTGTCCCTC 
CAAGCATGGGGAACATCTTCGCCAACCTCTTCAAGGGCCTTTTTGG 

GGTGGGCC^3GATGCTGCAGGGAAGACC^CGATCCTCTACAAGCTTAAGCTGGGTGAGATCGTGACCACC 

ATTCCCACCATAGGCTTCAACGTGGAAACCGTGGAGTACAAGAACATCAG 

GTGGCCAGGACAAGATCCGGCCCCTGTGGCGCCACTACTTCCAGAAC^ 

GGACAGCAATGACAGAGAGCGTGTGAACGAGGCCCGTGAGGAGCTCATGAGGATGCTGGCCGAGGACGAG 

CTCCGGGATGCTGTGCTCCTGGTGTTCGCCAAC^^ 

TCACAGACAAGCTGGGGCTGCACTCACTACGCCAGAGGi^ 

CGGCGACGGGCTCTATGAAGGACTGGACTGGOTGTCCAATCAGCTCCGG/^CCAGAAGTGAACGCGACCC 
CCCTCCCTCTCACTCCTCTTGCCCTCTGCTTTACTCTC7VTGTGGCAAACGTGCGGCTCGTGGTGTGAGTG 
CCAGAAGCTGCCTCCGTGGTTTGGTC^CCGTGTGCATCGCACCGTGCTGTA^ 
GCGGCCAGGCTTTTTATTTAATGTAAATAGTTTTTGOT 

ATATTACTCAGCTTTTTTTATTGTAAAAAGAAAAATCAACTCACTGTTCAGTGCTGAGAGGGGATGTAGG 
CCCATGGGC^CCTGGCCTCCAGGAGTCGCTGTGTTGGGAGAGCCGGCC^CGCCCTTGGCTTTAGAGCTGT 
GTTGAAATCCATTTTGGTGGTTGGTTTTTAACCCAAACTCAGTGCATTTTTTAAAATAGTTAAGAATCCA 
AGTCGAGAACACTTGAACACACAGAAGGGAGACCCCGCCTAGCATAGATTTGCAGTTACGGCCTGGATGC 
PVGTCGCCAGCCCAGOTGTTCCCCTCGGGAACAT^ 

GCATGGTCACAGTAGAGATCCCCGC^CTCGCTTGTCCTTGGGT(^CCCTG<^FTCCATAGCCATGTGCT 
TGTCCCTGTGCTCCCAGGGTTCCCAGGGGCCAGGCTGGGAGCCCACAGCCACCCCACTATGCCGCAGGCC 
GCCCTACCCACCrTCAGGCAGCCTATGGGACGCAGGGCCCCATCTGTCCOTCGGTCGCCGTGTGGCCAGA 
GTGGGTCCGTCGTCCCCAACACTCGTGCTCGCTCAGACACTTTGGCAGGATGTCTGGGGCC 
* GGAGCGCGTGCAAGQCGGGCAGGCGGTCC^CCTAGACCCACAGCCCCTCGGGAGC^CCCCACCTCTGTGT 
GTGATGTAGCTTTCTCTCCCTCAGCCTGCAAGGGTCCGATTTGCCATCGAAAAAGACAACCTCTACTTTT 
TTCTTTTGTATTTTGATAAACACTGAAGCTGGAGCTGTT^ 

CTATTTGGTGTCGTGGAACCTCTTACTGCTTTCAATACACGATTAGTAATCAAAAAAAAAA 
AAAAAAA 

Human Arfl niRNA sequence - var2 (public gi: 3 0583624) (SEQ ID NO: 326) 

ATGGGGAACATCTTCGCCAACCTCTTCAAGGGCCTTXlTGGCAAAAAAGAAA 

GCC^GATGCTGC^GGGAAGACCACGATCCTCTACAAGCTTAAGCTGGGTGAGATCGTGACCACCATTCC 
CACCATAGGCTTCAACGTGGAAACCGTGGAGTACAAGAACATC^ 

CAGGAG^GATCCGGCCCCTGTGGCGCCACTACTTCCAGAACACACAAGGCCTGATCTTCGTGG 

GCAATGACAGAGAGCGTGTGAACGAGGCCCGTGAGGAGCTCATGAGGATGCTGGCCGAGGACGAGCTCCG 

GGATGCTCTCCTCCTGGTGTTCGCCAACAAGCAGGACCTCCCCA^^ 

GACAAGCTGGGGCTGCACTCACTACGCCACAGQAACTGGTACA^ 

ACGGGCTCTATGAAGGACTGGACTGGCTGTCCAATCAGCTCCGGAACCAGAAGTAG 

Human Arfl mRNA sequence - var3 (public gi: 34527605) (SEQ n>NO:*327) 

AAAACCAACGCCTGGCTCGGAGCAGCAGCCTCTGAGGTGTCCCTGGCCAGTGTCCTTCCACCTGTCCACA 
AGCATGGGGAACATCTTCGCCAACCTCTTC^GG 
TGGGCCTGGATG.CTGC^GGGAAGACC^CGA.TCCTCTACAAGOT 
' • TCCCACCATAGGCraCAACGTGGAAACCGTGGAGTACAAGAA^ 
GGCCAGGAC/bvGATCCGGCCCCTGTGGCGCCACTACTTC 

AGAGCAATGACAGAGAGCGTGTGAACGAGGCCCGTGAGGAGCTC^TGAGGATGCTGGCCGAGGACGAGCT 
CCGGGATGCTGTCCTCCTOGTGTTCGCC^CAAGCAGGACCTCCCCAACGCC^TGAATGCGGCCGAGATC 
ACAGACAAGCTGGGGCTGCACTCACTACGCCyvCAGGAACTGGTAC^TTCAG 

GCGACGGGCTCTATGAAGGACTGGACTGGCTGTCCAATCAGCTCCGGAACCAGAAGTGAACGCGACCCCC 
CTCCCTCTCACTCCTCTTGCCCTCTGCTTT^^ 

AGAAGCTGCCTCCGTGGTTTGGTCACCGTGTGCATCGCACCGTGCTGTAAATGTGGCAGACGCAGCCTGC 
GGCC^GGCTTTTTATTTAATGTAAATAGTTTTTO 

ATTACTCAGCTTTTTTATTGTAAAAAGAAAAATC^CTCACTGTTCAGTGCTGAGAGGGGATGTAGGCCC 

ATGGGCACOTGGCCTCCAGGAGTCGCTGTGTTGGGAGAGCCGGCCACGCCCTTGGCTTTAGAGCTO 

GAAATCCATTTTGGTGGTTGGTTTTTAACCCAAACTCAGTGCATTTTTTAAAATAGTTA 

CGAGAACACTTGAACA(^CAGAAGGGAGACCCCGCCTAGCATAGATTTGCAGTTACGGCCTGGATGCCAG 

TCGCCAGCCCAGCTGTTCCCCTCGGGAACATGAGGTGGTGGTGGCGCAGCAGACTGCGATCAATTCTGCA 

TGGTCACAGTAGAGATCCCCGCAACTCGCTTGTCCTTGGGTCACCCTGCATTCCATAGCCATGTGCTTGT 
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rrnTGTGCTCCCACGGTTCCCAGGGGCCAGGCTGGGAGCCCACAGCCACCCCACTATGCCGCAGGCCGCC 
cSSScCTTCAG^CAGCCTATGGGACGCAGG^CCCATCTGTCCCTCGGTCGCCGTG 

ATGTAGCTTTCTCTCCCTCAGCCTGCAAGGGTCCGATTTGCCATCGAAAAAGACAACCTCTACTTTTTTC 
TTTTGTATTTTGATAAACACTGAAGCTGGAGCTGTTAAATTTATCT 

TTTGGTGTCGTGGAACCTCTTACTGCTTTCAATACACGATTAGTAATCAACTGTTTTGTATACTTGTTTT 
C^GTTTTCIATTTCGACAAACAAGCACTGTAATTATAGCTATTAGAATAAAATCTCrTAACTATTT 




ATATXACTG- — 
CCCATGGGCACCTTOCCTCCAGGAGTCGCT^ 

tcga^^ 

GTCGCCAGCCCAGCTGTTCCCCTCGGGAACATGAGGTGGTGGTGGCGCAGCAGACTGCGATCAATTCTGC 

ATGGTCACAGTAGAGATCCCCGCAACTCGCTTGTCCTTGGGT^ . 

TCCCTGTGCTCCCACGGTTCCCAGGGGCCAGGCTGGGAGCCCACAGCCACCCCACTATGCCGCAGGCCGC 

CCTACCCACCTTCAGGCAGCCTATGGGACGCAGGCCCCATCTGTCCCTCGGTCCGCGTGTGGCCAGAGTG 

GTCCGTCGTCCCCAACACTCGTGCTCGCTCAGACACTTTGGCAGGATGTCTGGGGC 

CGC^TG'CAAGCCGGGCAGGCGGTCCACCTAGACCCACAGCCCCTCGGGAGCACCCCACCTCTGTCT 

TGTAGCTTTCTCTCCCTCftGCCTGCAAGGGTCCGATTTGCCATCXaAAAA 

TTTGTATTTTGATAAACACTGAAGCTGGAGCTGTTAAATTTATCTTGGGGAAACCTCAGAACTGGTCTAT 
TTGGTGTCGTAGGAACCTCTTACTGCTTTCAATACACGATTAGTAATCAACTGTTTTGTATACTTGTTTT 
CAGTTTTCATTTCGACAAACAAGCACTGTAATTATAGCTATTAGAATAAAATCTCTTAACTATT 

Human Arfl nxRNA sequence - var5 (public gi: 7020834) (SEQ ID NO: 329) 

CCTTACCCGGCGTGCCCCGCGCCCGGAGGCGCTGACGTGGCCGCCGTCAGAGCCGCCATCTTGTGGGAGC 
AAAACC^CGCCTGGCTCGGAG(^GCAGCCTCTGAGGTGTCCCTGGCCAGTGTCCTTCCACCTGTCCA^ 
AGCATGGGGAACATCTTCGCCAACCTCT^ 

TGGGCCTGGATGCTGCAGGGAAGACCAGGATCCTCTACAAGCT^ 
TCCCACCATAGGCTTCAACGTGGAAACCGTGGAGTAC^ 

GGCCAGGAC^^GATCCGGCCCCTGTGGTOCCACTACTTCCAGAACACACAAGGCCT 

ACAGCAATGAC^GAGAGCGTGTGAACXSAGGCCCGTGAGGAGCTCZATGAGGATGCTGGCCGAGGACGAGCT 
CCGGGATGCTGTCCTCCTGGTGTTCGCCAACAAGCAGGACCT 

ACAGACAAGCTGGGGCTGCACTCACTACGCCACAGGAACTGGTACATTCAGGCCACCTGC^ 

GCGACGGGCTCTATGAAGGACTGGACTGGCTGTCCAAT 

CTCCCTCTCACTCCTCTTGCCCTCTC 

AGAAGCTGCCTCCGTGGTTTGGTCACCGTGTGCATCGCACCGTGCTGTAAATGTGGCAGACGCAGCCTGC 

GGCCAGGCTTTTTATTTAATGTAAATAGTTITTGTTTCCAATGAGGCAGTTTCTGGTAOT 

ATTACTCAGCTTTTTTTATTGTAAAAAGAAAAATCAACT 

CATGGGCACCTGGCCTCCAGGAGTCGCTGTGTTGGGAGAGC 

TGAAATCCATTTTGGTGGTTGGTTTTTAACCCAAACTCAGTGCATT 

TCGAGAAC^CTTGAACACACAGAAGGGAGACCCCGC 

GTCGCCAGCCCAGCTGTTCCCCTCGGGAACATGAGGTGGTGGTGGCGCAGCAGACTGCGATC^TTCTGC 

ATGGTGACAGTAGAGATCCCCGCAACTCGCTTG 

TCCCTGTGCTCCCACGGTTCCCAGGGGCC^GGCT^ 

CCTACCCACCTTCAGGC^GCCTATGGGACGCAGGGCCCCATC1X3TCCCTCGGTCGCCGTGTGGCCAGAGT 
GGGTCCGTCGTCCCCAACACTCGTGCTCGCTCAGACACTTTGGCAGGATGTCTGGGGCCTGACCAGCAGG 
AGCGCGTGCAAGCCGGGCAGGCGGTCCACCTAGACCCACAGCCCCT 

GATGTAGCTTTCTCTCCCTCAGCCTGCAAGGGTCCGATTTGCCATCGAAAAAGACAACCTCTACrTTT^ 
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ATTTGGTGTCGTGGAACCTCTTACTGCTTTCAATAC 

TCAGTTTTCATTTCGACAAAC7^GC^CTGTAATTATAGCTATTAGAATAAAATCTCTTAACTATTAAAAA 
AAAAAAAAAAAAAAAAAAAAA 

Human Arfl niRNA sequence- var6 (public gi: 10435849) (SEQ ID NO: 330) 

AGCTCAGTGCCCAGCATGTCTGTGGTGAGTGTGTAGTTCAG^ 

^ CTCTCTTCCTGGGTTCTTGC^CTC^ 

GTGTGCGCTGACCATGCTGAGATGGGCACTGTGGACTCAGCCTCTGGTCATTGCTGGAACCAGCGGCCTC 

CATGTGAGGTACAGGGGAACGCACTGCTAdCAGATGGTTGGGATGTGGACACTCGTCCTGCCCTCTTGGC 

TTGGTGCTGTGCCATCGCAC^GTCATTCGCTGTTTAGCATGCATGGGAGAGAGTGAAGCACAAGGGCC^ 

GGCCCCTGGGAGTGCCTGCCCTCAATTTGGAAGAGCCCTTGGGCACAGCATAGGCGCCTGGCAGAATTGG 

ACTGGGCCATGATCCAGGGCATTGGGACCTCACCTAGGAGTTGGGGTTC^GGTCA 

CAGGGTCTCCCCTGTGGGCACaU^CTGACerCAAACTGCrGGTTCTTTOT 

TGAAGTACTCTCCCGGCAGCTGTCACCTGCAGGGAGAGGTGGGGGTAGGGGTGCTGTGTTTCTTAGCTGT 
TCCTCGTTGCAGTGTTAAATCCCTGCAGGTTCC^ 

GCTAATGTGGGTTTGCGTTTTTGGTCTTGGTTTTCCCAGTGGCCAGTCCATCAGCCACTGCACTGGGGGC 
CAGGTAGAGGCCAACTGCACCCTGCCTGCCAGAGTAGAAATACTGGTAGGCCCCAGGCTCTGCTGCCCCT 
TCCATGTCCTTGTGTAAGCATCCATGGACAAAGCTGACTCACGGGGTGTGCACAGCTGCAGGGAGGCCAG 
GAAACAGGGGTTTTATTCTAGAGGGCCTTGTGCTCAGTGACAGACCAGAGTCCCATCACTGAGAGAGCAG 
GGCTGGGGCAGCACAAAGGACTGGATAGCATTTGCCATGATGCCATGTGCACAGCCAGTGCAAGTCCTTC 
ATTGTAGCTGTGGTGAGAGGTCATGAGACACTGCCTTCAGCAGCCCTGGGA 

AGAGCTGTGCATCTGCAGATTTCAGAAGGACTTACGTTTGGTGAGGTGCTTTGAAGTAACACTTCACAAA 
TACCAAGAAGCAAGAAATACACAAATAAGCAGGTAATGGTTOTTTGGTGTTTACATTAGCTAGTGG 
CGGTTCTTTGGTGTTC^CATTAGCTATAGTCCCAGAACTCAGTCCATGAGGTGGAATCAC^AAAATGGA^ 
TTCATTTCTGGCTGTGAGTACACA^CTGATTTAAGATATCACCTTGAATTTTAAGCTGACAAACAGTGA 
TCTAAACTGAATTTCACTGATTGCCCCACCTGAAAGTCAGACCTGATAGATAATGCCCTCCCTTAACTCA 
7VGGCCAGCAGCAGATGTGTTAGAGGGGACCCTTGTGCCTCGCAGCCCTCATCTCCTAATGGCTGTGGGGT 
CACTGTGTCGAGTTGTAATGCCTAATGAGCTCCTCTAAAAACATCCTGAAACTTGTGTAAAAAAACAGCA 
GACTCCCAGTGGAACTCGCCTTCAGATGCAGCCC^GAATAAGAGTTCTAGAATGTGTGTGCCATCCTTTT 
. GTCTCAATCTGCATGATTGCAAGTCTCTTCAACATGATTGGGTGCGTGGAGTGTCTCGGTCATGTGTCTT 
CCCCTCTGAGCATGCCTTTTGATTCGCACCTGTGTCACAATTGTGCCAGCCTGTGAGATGTGTCTGCCTG 
TCACCAGTATCGGCACATTTAGTTTTCCCTTTACGTGAGTTTT^ 

GTGCTCAGTCACAGAAAAATGTCAGGCCTACAGAAATGGAGCATTTGGCTGGTGGGTAGCGTGATGACCA 
TAGGCTTTATTTGGCTGGTGTGGTAAACAAGCAGCAGCTTGTGCAGGTGAGAATAAATGGCCATATTGCA 
TTTCATTTTAAGGACTCCCTTAAAATGAAAATCTT 
ATCATCTACACTATATCTATGACTGTTGAAAGGCT^^ 

GTACATGAGTCCCTTCAGGAAGTCATCAGCTTTGTTCAGTTTCCTCAGATTAGATAGTAAAACTGAGATT 

ATGAACTATAAAGATGTGTGTAATTTATCTGTCAGTGAACTTGACTTTAATAAAAGCTTTTTGAAAAAGA 

ACTCTGGGTGGGGTGCATTGGCTCACACACATAGTCCCAACTACTGTGGAGGTCAGGGCAGGAGGATCAC 

TGGAGCCCAAGAGTTCAAGATCAGCCTGGGCAGGATAGCGAGACCCTGTCTATAGAAAATATTAAAAATC 

AGCTAGGCATGGTGGCTTGCCCTTGCATTCCCTGCCACTTGGGAGGCTGAGGTGGGAGGTTCGCT 

CCAGGAGCTCAAGGCTGCAATGGGCTGTGATCGAACCACTGAATTCCAACCTGGGTGACAGAGTGAGGCC 

CTGTCTCAAAAAGAGAACTCTCGATGTCACTGGCTTTCC^TGTAAGC^GAGCACATCATGTGAGCCCavT 

TCGTGGATGTCAGTCAGCAGAACAGAATCTTGGXCCTGGAGCTTG 

TGTCTCTGTCTTTCTGTTGGTTCCTGTCAGTTCAGGTCACTT^ 

ACAGGTTGGGGGAGATGATCAACCTTCCGCCTGCGCCTGTT^ 

TGAGATCTGCAAGGGACCGAGGACAGTACTGdCTGGTGGTCTGGGTACAGGCCACAGAGGCATCTGGACC 
CCATGTGCATCTGGACCAGTTTGGTTGGATCCATTCATGGACACAAAACGGATGTGAACTC^ 
CATTTTCTCCCTGCCCCTGTTCAGGCACAGTGAGGTGTCGGGGAATGTAGGTGCCAG^ 
GTTCTTTGGTGTAATGCCTGAAGGCCACCTTTACCA 

CTTCCTAAAAGACTTTTTTTCCTCAGAGTTGGAGCCCACAGCGTGGTCAGGAAAGAGAAGTAGCCACTGG 

TGGCTCCTGGCATCCTCCTGCTGGGCAGCCCCTTCTCAAAGTGTGAGGGGTCCCCTTGTGTAGAAGCAGG 

AAGGCTCTGAGAAAGTCAGGTTTGCTCCTACCACAGGATAATTCCGATGAACCTGAAAAGCGGGTTTTGG 

CTTGTGTGCAGGGACTCTGGTGGAAGAAAGGGTGACAGCACCTGGCCTGGGCATGACACAA 

CGTACCAAGAGGCCCTGGAATTGAGGGTGGGGGTTGCTGTGGACTCTTTCTCCCTCTTAGGAAACTCTAT 

TGGCjTCTCCATCTGTCACAGA&GCAGTAAATGATGTA 

TGGAAGATGCCGAGGCAGGACGTGCCAGCCACCCTCTGCCCATATGTGCAGCAGGGCC^ 
GTCGGTAGGAGAGACCAAGCTGTCTGTGTGCCGATGTCTTGACACCTGAGACTTCAGGTTC^ 
GGTTCTGCCATTCCATTGCAGGGTGGCTTCCCTCCTTTC 
AAAAAAAAAAAAAAATCCGGGCGTGGTGGCTCACTCCTGTAATCC 

CTGATCATCTGAGGTCAGGGGTTCGAGGCCAGCCCTGACCAACATGGTGAAACCCCGTCTCTACT 
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Human Arfl raRNA sequence - var7 (public gi: 14714S85)(SEQ ID NO: 331) 

^S^TCTTCC^CAACCTCTTCAAGGGCOT 
CTGGATG^ 

GGACAAGATCCGGCCCCTGTGGCGCCACTACTTCCAGAACACACAAGGCCTGA 

AATGACAGAGAGCGTGTGAACGAGGCCCGTGAGGAGCTCATGAGGATGCTGGCCGAGGACGAGCTCCGGG 
AXGCTGTCCTCCTGGTGTTCGCCAACAAGCAGGA 

CAAGCTGGGGCTGCACTCACTACGCCACAGGAACTGGTACATTCAGGCCACCTGCG 

GGGCTCTATGAAGGAOTGGACTGGCTGTCCAATCAGCTCCGGAACCAGAAGTGAACGCGACCCCCCTCCC 
TCTCACTCCTCTTGCCCTCTGCTTTACTCTCATGTGGCAAACGTGCGGCTCGTC 

CTGCCTCCGTGGTTTGGTCACCGTGTGCATCGCACCGTGCTGTAAATGTGGCAGACGCAGCCTGCGGCCA 

GGCTTTTTATTTAATGTAAATAGTTTTTGTTTCCAATGAGG 
TCAGCTTTTTTTATTGTAAAAAGAAAAATCAACTCACTGOT 

GCACCTGGCCTCCAGGAGTCGCTGTGTTGGGAGAGCCGGCCACGCCCTTGGCTTTAGAGCTGTGTTGAAA 
TCCATTTTGGTGGTTGGTTTTTAACCCAAACTCAGTGCATTTTTTAAAATAGTTAAGAATCCAAGTCGAG 
AACACTTGAACACACAGAAGGGAGACCCCGCCTAGCATAGATTTGCAGTTACGGCCTGGATGCCAGTCGC 
CAGCCCAGCTGTTCCCCTCGGGAACATGAGGTGGTGGTGGCGCAGCAGACTGCGATCAATTCTGCATGGT 
CACAGTAGAGATCCCCGCAACTCGCTTGTCCTTGGGTCACCCTGCATTCCATAGCCATGTGCTTGTCCCT 
GTGCTCCCACGGTTCCCAGGGGCCAGGCTGGGAGCCCACAGCCACCCCACTATGCCGCAGGCCGCCCTAC 
CCACCTTCAGGCAGCCTATGGGACGCAGGGCCCCATCTGTCCCTCGGTCGCCGTGTGGCCAGAGTGGGTC 
CGTCGTCCCCAACACTCGTGCTCGCTCAGACACTTTGGCAGGATGTCTGGGGCCTCACCAGCAGGAGCGC 
GTGCAAGCCGGGCAGGCGGTCCACCTAGACCCACAGCCCCTCGGGAGCACCCCACCTCTGTGTGTGATGT 
AGCTTTCTCTCCCTCAGCCTGCAAGGGTCCGATTTGCCATCGAAAAAGACAACCTCTACTTTTTTCTTTT 
GTATTTTGATAAACACTGAAGCTGGAGCTGTTAAATTTATCTTGGGGAAACCTCAGAACTGGTCTATTTG 

GTGTCGTGGAACCTCTTACTGCTTTGAATACACGATTAGT^ 

TTTCATTTCGAGAAACAAGCACTGTAATTATAGCTAl^ 

AAAAAAAAAAAAAAAAAAAAA 

Human Arfl roRNA sequence - varS (public gi: 33872952) (SEQ ID NO: 332) 

GTCCAATCAGCTCCGGAACCAGAAGTGAACGCGACCCCCCTCCCTCTCACTCCTCTTGCCCTCTGCTTTA 
CTCTCATGTGGCAAACGTGCGGCTCGTGGTGTGAGTGCCAGAAGCTGCCTCCGTGGTTTGGTCACCGTGT 
GCATCGCACCGTGCTGTAAATGTGGCAGACGCAGCCTGCGGCCAGGCT^ 

TTGTTTCCAATGAGGCAGTTTCTGGTACTCOTATGCAATATTACTCAGCTTTTTTTATTGTAAAAAGAAA 
AATCAACTCACTGTTCAGTGCTGAGAGGGGATGTAGGCCCATGGGCACCTGGCCTCCAGGAGTCGCTGTG 
TTGGGAGAGCCXK5CCACGCCCTTGGCTTTAGAGCTGTGTTGAAATCCATTTTGGTGGTTGGTTTTTAACC 
CAAACTCAGTGCATTTTTTAAAATAGTTAAGAATCCAAGTCGAGAACACT^ 

CCCGCCTAGCATAGATTTGCAGTTACGGCCTGGATGCCAGTCGCCAGCCCAGCTGTTCCCCTCGGGAACA 

TGAGGTGGTGGTGGCGCAGCAGACTGCGATCAATTCTGCATGGTCACAGTAGAGATCCCCGCAACTCGCT 

TGTCCTTGGGTCACCCTGCATTCCATAGCCATGTGCTTGTCCCTGTGCTCCCACGGTTCCCAG 

GCTGGGAGCCCACAGCCACCCCACTATGCCGCAGGCCGCCCTACCCACCTTCAGGCAGCCTATGGGACGC 

AGGGCCCGATCTGTCCCTCGGTCGCCGTGTGGCCAGAGTGGGTCCGTCX3TCCCCAACACTCGTGCTCGCT 

cagacactttggc^ggatgtctggggcctcaccagcaggagcgcgtgcaagccgggcaggcggtccacct 
agaccc^cagcccctcgggagcaccccacctctgtgtgtgatgtagctttctctccctcagcctgcaagg 
gtcc^Atttgccatcgaaaaagac^cctctact^ 

gctgttaaatttatcttggggaaacctcagaactggtctatttggtgtcgtggaacctcttact 

aatagacgattagtaatcaactgttttgtatacttgttttc^ 

attatagctattagaataaaatctcttaactattaaaaaaaaaaaaaaaaaa 

Human Arfl mRNA sequence - var9 (public gi: 15030200) (SEQ ID NO: 333) 

GAGCCGCCATCTTGTGGGAGCAAAACCAACGCCTGGCTCGGAGCAGCAGCCTCTGAGGTGTCCCTGGCCA 
GTGTCCTTCCACCTGTCCACAAGCATGGGGAACATCT 

AAGAAATGCGCATCCTCATGGTGGGCCTGGATGCTGCAGGGAAGACCACGATCCTOTACAAGCTT7UVGCT 
GGGTGAGATCGTGACCACCATTCCCACCATAGGCTTCAACGTGGAAACCGTGGAGTACAAGAACATCAGC 
TTCACTGTGTGGGACGTGGGTGGCCAGGAC^^^ 

AAGGCCTGATCTTCGTGGTGGACAGCAATGACAGAGAGCGTGTGAACGAGGCCCGTGAGGAGCTCATGAG 
GATGCTGGCCGAGGACGAGCTCCGGGATGCTGTCCTCCTGGTGTTC^ 

GCCATGAATGCGGCCGAGATCACAGACAAGCTGGGGCTGCACTCACTACGCC^CAGGAACTGGTACATTC 

AGGCCACCTGCGCCACCAGCGGCGACGGGCTCTATGAAGGACTGGACTGGCTGTCCAATCAGCTCCGGAA 

CCAGAAGTGAACGCGACCCCCCTCCCTCTCACTCCTCTTGCCCTCTGCTTTACTCTCATG 

GCGGCTCGTGGTGTGAGTGCGAGAAGCTGCCTCCGTGGTTTGGTCACCGTGTGCATCGCACCGTGCTGTA 

AATGTGGCAGACGCAGCCTGCGGCCAGGCTTTTTATTTAATGTAAATAGTTTTTGTTTCCAATGAGGCAG 



Figure 36 part -4 



P C TV" U S O Nk/ O 6 3 O 8 

TTTCTGGTACTCCTATGCAATATTACTCAGCTT^^ 

TGCTGAGAGGGGATGTAGGCCC^TGGGC^CCTGGCCTCCAGGAGTCGCTGTGTTGGGAGAGCCGG 

CCCTTGGCTTTMAGCTGTOTTOAAATCC^ 
TAAAATAGTTAAGAATCCAAGTCGAGAACACTT 
GC^GTTACGGCCTGGATGCCAGTCGCC^CCC^^ 
GCAGACTGCGATCAATTCTGCATGGTCACAGTAGAGATCCCCG^ 

CATTCCATAGCCATGTGCTTGTCCCTGTGCTCCCACGGTTCCCAGGGGCCAGGCTGGGAGCCCACAGCCA 
CCCC^CTATGCCGCAGGCCGCCCTACCC^ 

CGGTCGCCGTGTGGCCAGAGTGGGTCCGTCGTCCCG^CACTCGTGCTCGCTCAGACACTTTGGCAGGAT 
GTCTGGGGCCTCACCAGCAGGAGCGCGTGCAAGCCGGGCAGGCGGTCCACCTAGACCCACAGCCCCTCGG 

G3U3CAGCCCAGCTCTGTGTGTGATGTAGCTTTCTCT 

AAAGACAACCTCTACTTTTTTCTTTTGTATTTTGATAAACACTGAAGCTGGAGCTGTTAAATTTATCTTG 
GGGAAACCTCAGAACTGGTCTATTTGGTGTCX3TGGAACCTCTTACTGCTTTCAATACACGATTAGTAATC 
AACTGTTTTGTATACTTGTTTTCAGTTTTCATTTCGACAAACAAGCACTGTAATTATAGCTATTAGAATA 

AAATCTCTTAAOTATTAAAAAAAAAAAAAAAAAA 

Hiunan Arfl raRNA sequence - varlO (public gi: 16553 846) (SEQ ID NO: 334) 

GTGGGAGCAAAACCAACGCCTGGCTCGGAGCAGCAGCCTCTGAGGTGTCCCTGGCCAGTGTCCTTCCACC 
TGTCCACAAGCATGGGGAACATCTTCGCCAACCTCTTtlAAGGGCCTTTTTGGCAAAAAAGAAATGCGCAT 
CCTCATGGTGGGCCTGGATGCTGCAGGGAAGACCACGATCCTCTACAAGCTTAAGCTGGGTGAGATCGTG 
ACCACGATTCCCACCATAGGCTTCAACGTGGAAACCGTGGAGTAG^ 

ACGTGGGTGGCCAGGACAAGATCCGGCCCCTGTGGCGCCACTACTTCCAGAACACACAAGGCCTGATCTT 
CGTGGTGGACAGCAATGACAGAGAGCGTGTGAACGAGGCCCGTGAGGAGCTCATGAGGATGCTGGCCGAG 
GACGAGCTCCGGGATGCTGTCCrrCCTGGTGTTCGCCAACAAGCAGGACCTCCCCAACGCCTVTGAATGCGG 
CCGAGATCACAGACAAGCTGGGGCTGCACTCACTACGCCACAGGAACTGGTACATTCAGGCCACCTGCGC 
CACCAGCGGCGACGGGCTCTATGAAGGACTGGACTGGCTGTCCAATCAGCTCCGGAACCAGAAGTGAACG 
.CGACCCCCCTCCCTCTCACTCCTCTTGCCCTCTGCTTTACTCTCATGTGGCAAACGTGCGGCTCGTGGTG 
TGAGTGCCAGAAGCTGCCTCCGTGGTTTGGTCACCGTGTGCATCGCACCGTGCTGTAAATGTGGCAGACG 
CAGCCTGCGGCCAGGCTTTTTATTTAATGTAAATAGTTTTTGTTTCCAATGAGGCAGTTTCTGGTACTCC 
TATGCAATATTACTCAGCTTTTTTTATTGTAAAAAGAAAAATCAACTCACTGTTCAGTGCTGAGAGGGGA 
TGTAGGCCCATGGGCACCTGGCCTCCAGGAGTCGCTGTGTTGGGAGAGCCGGCCACGCCCTTGGCTTTAG 
AGCTGTGTTGAAATCCATTTTGGTGGTTGGTTTTTAACCCAAACTCAGTGCATTTTTTAAAATAGTTAAG 
AATCOUIGTCGAGAACACTTGAAC^CACAG 

GGATGCCAGTCGCCAGCCCAGCTGTTCCCCTCGGGAACATGAGGTGGTGGTGGCGCAGCAGACTGCGATC 

AATTCTGCATGGTC^C^GTAGAGATCCCCGCAACTCGCTTGTCCTTGGGTCACCCTGC^ 

TGTGCTTGTCCCTGTGCTCCC^CGGTT'CCCAGGG 

CAGGCCGTCCTACCCACCTTCAGGCAGCCTATGGGACGCAGGGCCCCATCTGTCCCTCGGTCGCCGTGTG 

GCCAGAGTGGGTCCGTCGTCCCCAACACTCGTGCTCGCTCAGACACTTTGGCAGGATGTCTGGGGCCTCA 

CCAGCAGGAGCGCGTGCAAGCCGGGCAGGCGGTCCACCTAGACCCACAGCCCCTCGGGAGCACCCCACCT 

CTGTGTGTGATGTAGCTTTCTCTCCCTCAGCCTGC^GGGTCCGATTTGCCATaSAAAAAGACy^CCTCT 

ACTTTTTTCTTTTGTATTTTGATAAACACTGAAGCTGGAGCTGTTAAATTTATCTTGGGGAAACCTCAGA 

ACTGGTCTATTTGGTGTCGTGGAACCTCTTACTGCTTTCAATACACGATTAGTAATCAACTGTTTTGTAT 

ACTTGTTTTCAGTTTTCATTTCGAGAAACy^GCACTGTAATTATAGCTATTAGAATA 

ATT 

Human Arfl niRNA sequence - varl 1 (public gi: 16553799) (SEQ ID NO: 335) 

AACCAACGCCTGGCTCGGAGCAGCAGCCTCTGAGGTGTCCCTGGCC^ 

CATGGGGAAC^TCTTCGCCAACCTCTTCAAGGGCCTTTTTGGCAAAAAAGAAATGCGCATCCTCATGGTG 
GGCCTGGATGCTGCAGGGAAGACCACGATCCTCTACAAGCTTAAGCTGGGTGAGATCGTGACCACCATTC 
CC^CCATAGGCTTCAACGTGGAAACCGTGGAGTACAAGAACAT 
CCAGGACAAGATCCGGCCCCTGTGGCGCCACTACTTCCAGAACACACJ^ 

AGCAATGACAGAGAGCGTGTGAACGAGGCCCGTGAGGAGCTCATGAGGATGCTGGCCGAGGACGAGCTCC 
GGGATGCTGTCCTCCTGGTGTTCGCCAACAAGCAGGACCTCCCCAACGCCATGAATGCGGCCGAGATCAC 
AGACAAGCTGGGGCTGCACTCACTACGCCACAGGAACTGGTACATTCAGGCCACCTGCGCCACCAGCGGC 
GACGGGCTCTATGAAGGACTGGACTGGCTGTCCAATCAGCTCCGGAACCAGAAGTGAACGCGACCCCCCT 
CCCTCTCACTCCTCTTGCCCTCTGCTTTACTCTCATGTGGCA^ 

AAGCTGCCTCCGTGGTTTGGTCACCGTGTGCATCGCACCGTGCTGTAAATGTGGCAGACGCAGCCTGCGG 

CCAGG<mTTTATTTAATGTAAATAGTTTTTC 

TACTCAGCTTTTTTTATTGTAAAAAGAAAAATCAACTCACT 

TGGGCACCTGGCCTCCAGGAGTCGCTGTGTTGGGAGAGCCGGC 

AAATCCATTTTGGTGGTTGGTTTTTAACCCAAAC^CAGTGCATTTTTT 

GAGAACACTTGAACACACAGAAGGGAGACCCCGCCTAGCATAGATTTGCAGTTACGGCOTGGATG 
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CGCCAGCCC^GCTGTTCCCCTCGGGAACATGAGGTGGTGGTGGCGCAGCAGACTGCGATC^TT^CAT 
CCTGTGCTCCCACGGTTCCC^GGGGCCAGGCTGGGAGCCC^C^GCCACCCCACT^ATGCOT^TCCOT^C 

^ TTTGTATOT^GATAAACACTGAAGCTGGAG 

TTGGTGTCGTGGAACCTCTTACTGCTTTCAATACACGATTAGTAATC 

Human Arfl mRNA sequence - varl2 (public gi: 20147654) (SEQ ID NO: 336) 

^G^ 

GCCTGGATGCTGCAGGGAAGACCACGATC CT CTACAAG ^^^^^ TG ^^ TGA ^^ ^^^^^33^ 
CACCATAGGCTTCAACGTGGAAACCGTGGAGTACAAGAACAT 

CAGGACAAGATCCGGCCCCTGTGGCGCCACTACTTCCAGAACACACAAGGCCTGATCTTC^ 
GGAATGACAGAGAGCGTGTGAACGAGGCCCGTGAGGAGOTC 

GGATGCTGTCCTCCTGGTGTTCGCCAACAAGCAGGACCTCCCCAACGCC^TGAATGCGGCCGAGA 

GACAAGCTGGGGCTGCACT»CTACGCCA(^GGAACTGGTACATTCAGGCCACCTGCGCCACCAGCGGCG 

ACGGGCTCTATGAAGGACTGGACTGGCTGTCCAATCAGCTCCGGAACCAGAAGTGA 

Human Arfl mRNA sequence - varl3 (public gi: 178163) (SEQ ID NO^ 337) 

AAACCAACGCCTGGCTCGGAGCAGCAGCCTCTGAGGTGTCCCTGGCCAGTGTCCTTCCACCTGTCCACAA 
GCATGGGGAACATCTTCGCCAACCTCTTCAAGGGCCTTTTTGGCT^AAAAGAAATGCGCATCCTCATGGT 
GGGCCTGGATGCTGCAGGGAAGACCACGATCCTCTACAAGCTTAAGCTGGGTGAGATCGTGACCACCATT 
CCCACCATAGGCTTCAACGTGGAAACCGTGGAGTACAAGAACATCAGCTTCACTGTGTGGGACGTGGGTG 
GCCAGGACAAGATCCGGCCCCTGTGGCGCCACTACTTCCAGAACACACAAGGCCTGATCTTCGTGGTGGA 
- CAGCAATGACAGAGAGCGTGTGAACGAGGCCCGTGAGGAGCTCATGAGGATGCTGGCCGAGGACGAGCTC 
CGGGATGCTGTCCTCCTGGTGTTCGCCAACAAGCAGGACCTCCCCAACGCCATGAATGCGGCCGAGATCA 
CAGACAAGCTGGGGCTGCACTCACTACGCCACAGGAACTGGTACATTCAGGCCACCTGCGC 
CGACGGGCTCTATGAAGGACTGGACTGGCTGTCCAATCAGCTCCGGAACCAGAAGTGAACGCGACCCCCC 

TCCCTCTCACTCCTCTTGCCCTCTGCTTTACTCTC 

GAAGCTGCCTCCGTGGTTTGGTCACCGTGTGC^TCGCACCGTGCTGTAAATGTGGCAGACGCAGCCTGCG 

GCCAGGCITTTTATTTAATGTAAATAGTTTTTGTT^ 
TTACTCAGCTTTTTTTATTGTAAAAAGAAAAAT 

ATGGGCACCTGGCCTCCAGGAGTCGCTGTGTTGGGAGAGCCGGCCACGCCCTTGGCTTTAGAGCTGTGTT 

GAAATCCATTTTGGTGGTTGGTTTTTAACCCAAACTCAGTGCATTTTTTAAAATAGTTAA 

CGAGAACACTTGAACACACAGAAGGGAGACCCCGCCTAGCATAGATTTGCAGTTACGGCCTGGATGCCAG 

TCGCCAGCCCAGCTGTTCCCCTCGGGAACATGAGGTGGTGGTGGCGCAGCAGACTGCGATCAATTCTGCA 

TGGTCACAGTAGAGATCCCCGCAACTCGCTTGTCCTTGGGTCACCCTGCATTCCATAGCCATGTGCTTGT 

CCCTGTGCTCCCACGGTTCCCAGGGGCCAGGCTGGGAGCCCACAGCCACCCCACTATGCCGCAGGCCGCC^ 

CTACCCACCTTCAGGCAGCCTATGGGACGCAGGGCCCCATCTGTCCCTCGGTCGCCGTGTGGCCAGAGTG 

GGTCCGTCGTCCCCAACACTCGTGCTCGCTCAGACACTTTGGCAGGATGTCTGGGGCCTCACCAGCAGGA 

GCGCGTGCAAGCCGGGC^GGCGGTCCACCTAGACCCACAGCCCCTCGGGAGCACCCCACCTCTGTGTGTG 

ATGTAGCTTTCTCTCCCTC^GCCTGC^ 
TTTTGTATTTTGATAAAC^CTGAAGCTGGAGCTGTT 

TTTGGTGTCGTGGAACCTCTTACTGCTTTCAATACACGATTAGTAATCAACTGTTTTGTATACTTGTTT^ 
CAGTTTTCATTTCGACAAACAAGCACTGTAATTATAGCTATTAGAATAAAAT 

Human Arfl mRNA sequence - varl4 (public gi: 178982) (SEQ ID NO: 338) 

GGGGAAAACCAACGCCTGGCTCGGAGCAGCAGCCTCTGAGGTGTCCCTGGCCAGTGTCCTTCCACCTGTC 
CACAAGCATGGGGAACATCTTCGCGAACCTCTTG^ 

ATGGTGGGCCTGGATGCTGCAGGGAAGACCACGATCCTCTAGAAGCTTAAGCTGGGTGAGATCGTGACC^ 
CCATTCCCACCATAGGCTT(^CGTGGAAACCGTGGAGTA(^GAACATCAGCTTCACTGTGTGGGACG^ 
GGGTGGCCAGGACAAGATCCGGCCCCTGTGGCGCCACTACTTCCAGAACACACAAGGCCTGATCTTCGTG 
GTGGACAGCAATGACAGAGAGCGTGTGAACGAGGCCCGTGAGGAGCTCATGAGGATGCTGGCCGAGGACG 

AGCTCCGGGATGCTGTCCTCCTGGTGTTCGCCAAC^ 

GATCACAGACAAGCTGGGGCTGCACTCACTACGCC^CAGGAACTGGTACATTCAGGCCACCTGCGCCACC 
AGCGGCGACGGGCTCTATGAAGGACTGGACTGGCTGTCCAATCAGCTCCGGAACCAGAAGTGAACGCGAC 

CCCCCTCCCTCTCACTCCTCTTGCCCTCTGC 

TGCCAGAAGCTGCCTCCGTGGITTGGTCACCGTGTGCATCGC^CCGTGCTGTAAATGTGGCAGA 

CTGCGGCCAGGCTTTTTATTTAATGTAAATAGTTTTTGTTTCCAATGAGGCAGTTTCTGGTACTCCTATG 

CAATATTACTC^GCTTTTTTTATTGTAAAAAGAAAAATCTVACTCACTG 
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^CCTACckcOT^ 

tcgSotStccccaacactcgtgctcgct 



CTTTTGTATTTTGATAAACACTGAAGCTGGAGCTGTTAAATTTATCTTGGGGAAACCTCAGAACr 
ATTTGGTGTCGTAGGAACCTCTTACTGCTTTCAATACACGATTAGTAATCAACTGTTTTGTATACTTGTT 
TTCAGTTTTCATTTCGACAAACAAGCACTGTAATTATAGCTATTAGAATAAAATCTCTTAACTATT 

HtraaiiArfl mRNA sequence - var!5 (public gi: 3005720) (SEQ ID NO: 339) 

AAACCAACGCCTGGCTCGGAGCAC^GCCTCTGAGGTGTCCCTGGCCAGTGTCCTTCCACOT 

GCATGGGGAACATCTTCGCCAACCTCTTCAAGGGCCTTTTTGGCAAAAAAGAAA 

GGGCCTGGATGCTGCAGGGAAGACCACGATCCTCTACAAGCTTAAGOTGGGTGAGATCG^ 

CCCACCATAGGCTTCAACGTGGAAACCGTGGAGTACAAGAACATCAGCTTCACTGTGTGGGACGTGGGTG 

GCCAGGACAAGATCCGGCCCCTGTGGCGCCACTACTTCCAGAACACACAAGGCCTGATCTTCGTGGTGGA 

CAGCAATGACAGAGAGCGTGTGAACGAGGCCCGTGAGGAGCTCATGAGGATGCTGGCCGAGGACGAGCTC 

CGGGATGCTGTCCTCCTGGTGTTCGCCAACAAGCAGGACCTCCCCAACGCCATGAATGCGGCCGAGATCA 

CAGACAAGCTGGGGCnXSCACTCACTACGCCACAGGAACTC 

CGACGGGCTCTATGAAGGACTGGACTGGCTGTCCAATCAGCTCCGGAACC7VGAAGTGAACGCGACCCCCC 
TCCCTCTCACTCCTCTTGCCCTCTGCTTTACTCTCATGTGGCAAACGTGCGGCTCGTGGTGTGAGTGCCA 
GAAGCTGCCTCCGTGGTTTGGTCACCGTGTGCATCGCACCGTGCTGTAAATGTGGCAGACGCACCTGCGG 
CCAGGCTTTTTATTTAATGTAAATAGTTTTTGTTTCCAATGAGGCAGTTT 

Vactcagctttttttattgtaaaaagaaaaatcaactcactgttcagtgctgagaggggatgtaggccca 

TGGGCACCTGGCCTCCAGGAGTCGCTGTGTTGGGAGAGCCGGCCACGCCCTTGGCTTTAGAGCTGTGTTG 

AAATCCATTTTGGTGGTTGGTTTTTAACCCAAACTCAGTO 

GAGAACACTTGAACACACAGAAGGGAGACCCCGCCTA 

CGCCAGCCCAGCTGTTCCCCTCGGGAACATGAGGTGGTC 

GGTCACAGTAGAGATCCCCGCAACrrCGCTTGTCCTTGGTCACCCTGCATTCCATAGCCATGTGCTTGTCC 

CTGTGCTCCCACGGTTCCC^GGGGCCAGGCTGGGAGCCCACAGCCACCCCACTATGCCGCAGGCCGCCCT 

ACCCACCTTCAGGCAGCCTATGGGACGCAGGGCCCCATCTGTCCCTCGGTCGCCGTGTGGC 

TCCGTCGTCCCCAACACTCGTGCTCGCTCAGACACTTTGGCAGGATGTCTGGGGCCTCACCAGCAGGAGC 

GCGTGCAAGCCGGGCAGGCGGTCCACCTAGACCCACAGCCCCTCGGGAGCACCCCACCTCTGTGTGTGAT 

GTAGCTTTCTCTCCCTCAGCCTGCAAGGGTCCGATTTGCCATCGAAAAAGACAACCTCTACTTTTTTCTT 

TTGTATTTTGATAAACACTGAAGCTGGAGCTGTTAAATTTATCTTGGGGAAACCTCAGAACTGGTCTATT 

TGGTGTCGTGGAACCTCTTACTGCTTTCAATACACGATTAGTAATCAAAAAAAAAAAAAAAAAAAAAAAA 

AAA 

Human Arfl protein sequence - varl (public gi: 3 3604 91) (SEQ ID NO: 223) 

MGNI FAJHjFKGIjFGKKEMRI LMVGLDAAGKTTI LYKLKLGE I VTT I PT I GFNVETVE YKNI S FTVWDVGG 
QDKI RPLWRHYFQNTQGL I FVVDSNDRERVNEAREEIM^IiAEDEIjRDAVIiLVFANKQDLPNA I T 

DKLGLHS LRHRNWYIQATCATSGDGLYEGIiDWIiSNQLRNQK 
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TTniaene Name: ARF5 Unigene ID: Hs. 430657 

Human ARF5 mRNA sequence - varl (public gi: 178986) (SEQ ID NO: 340) 

CC^GTTCCAG^ 
C^CGCTCT^ 

™acc^caatcctgtaca^ 
. ^^cagtggaatataagaac^tctgtttcacagtctgggacgtgg 

CTCTOTGGOQQCIAiCTACTTCC^ 

GGTCCAAGAATCTGCTGATGAACTCCAGAAGATGCraCAGGAGGACGAGCTGCGGGATG 

gtatttgccaac^gcaggacatgcccaacgccatgcccgtgagcga^ 

AGCAOTACGOAGCCGCACGTGGTATGTCCAGGCCACCTCTGCCA 

TCTGGACTGGCTGTCCCACGAGCTGTCAAAGCGCTAACCAGCCAGGGGCAGGCCCCTGATGCCCGGAAGC 
TCCTGCGTGCATCCCCGGGATGACCAGACTCCCGGACTCCTCAGGCAGTGCCCTTTCCTCCCACTTTTCC 

TCCCCCATAGCCACAGGCCTCTGCTCCTGCTCCTGCCTGCATGCT 
TTGCTCTCTGGGCACAGAGGGGTCCACTOTCCTGCCTGCTGGGACCTATGGAAGGGG 

GCCCCCTCTTCGAGAGGAGGAGCAGGGATCTGGGTTTC 

GGCCAGGTTGGGAGGGGGAAGGTGAGGGCTTCGGGTGGTGCTATAATGTGGCACTGGATCTTGAGTAATA 
AATTTGCTGTGGTTTG 

Human ARF5 mRNA sequence - var2 (pubHc gi: 21620017) (SEQ ID NO: 341) 

CTCCTCCTGCTGCTGCTGCGCCCCATCCCCCCGCGGCCGGCCAGTTCCAGCCCGCACCCCGCGTCGGTGC 
CCGCGCCCCTCCCCGGGCTCCGCCATGGGCCTCACCGTGTCCGCGCTCTTTTCGCGGATCTTCGGGAAGA 
AGCAGATGCGGATTCT<^TGGTTGGCTTGGATGCGGCTGGC^GACCACAATCCT 

GGGGGAGATTGTCACCACCATCCCAACCATAGGCTTCAATGTAGAAACAGTGGAATATAAGAACATCTG 

TTCACAGTCTGGGACGTGGGAGGCCAGGACAAGATTCGGCCTCTGTGGCGGCACTACTTCCAGAACACTC 

AGGGCCTCATCTTTGTGGTGGACAGTAATGACCGGGAGCGGGTCCAAGAATCTGCTGATGAACTCCAGAA 

'GATGCTGCAGGAGGACGAGCTGCGGGATGCAGTGCTGCTGGTATTTGCCAAC^GCAGGACATGCCCAA^ 

GCCATGCCCGTGAGCGAGCTGACTGAGAAGCTGGGGCTACAGCAC 

AGGCCACCTGTGCCACCCAAGGeACAGGTCTGTACGATGGTCTGGACTGGCTGTCCCACGAGCTGTCAAA 
GCGCTAACCAGCCAGGGGCAGGCCCCTGATGCCCGGAAGCTCCTGCGTGCATCCCCGGGATGACCAGACT 
CCCGGACTCCTCAGGCAGTGCCCTTTCCTCCCACTTTTCCTCCCCCATAGCC^CAGGCCTCTGCTCCTGC 

TCCTGCCTGCATGTTCTCTCTGTTGTTGGAGCCTGGA 

CCTGCCTGCTGGGACCTATGGAAGGGGCTTCCTGGCCAAGGCCCCCTCTTCCAGAGGAGGAGCAGGGATC 
TGGGTTTCCTTTTTTTTTTCTGTTTTGGGTGTACTCTAGGGGCCAGGTTGGGAGGGGGAAGGTGAGGGCT 

TCGGGTGGTGCTATAATGTGGCACTGGATCTTGA 
AAAAAAAAAAAAAAAA 

Human ARF5 mRNA sequence - var3 (public gi: 12804364) (SEQ ID NO: 342) 

CCCGCGTCGGTGCCCGCGCCCCTCCCCGGGCCCCGCCATGGGCCTCACCGTGTCCGCGCTCTTTTCGCGG 
ATCTTCGGGAAGAAGCAGATGCGGATTCTCATGGTTGGCT^ 

ACAAACTGAAGTTGGGGGAGATTGTCACCACCATCCCAACCATAGGCTTCAATGTAGAAACAGTGGAATA 
TAAGAACATCTGTTTOKMTCTGGGA 

TTCCAGAACACTCAGGGCCTCATCTTTGTGGTGGACAGTAATGACCX3 

ATGAACTCCAGAAGATGCTGCAGGAGGACGAGCTGCGGGATGCAGTGCTGCTGGTATTTGCCAAC^GC^ 
GGACATGCCCAACGCCATGCCCGTGAGCGAGCTGACTGACAAGCTGGGGCTACAGCACTTACGCAGCCGC 
ACGTGGTATGTCCAGGCCACCTGTGCCACCCAAGGCACAGGTCTGTACGATGGTCTGGACTGGCTGTCCC 
ACGAGCTGTCAAAGCGCTAACC^GCCAGGGGCAGGCCCCTGATGCCCGGAAGCTCCTGCGTGCATCCCCG 

GATGACCATACTCCCGGACTCCTCAGGCAGTGCC 

CTCTGCTCCTGCTCCTGCCTGCATGTTCTCTCTGTTGTTGGAGCCTGGAGCCTTGCTCTCTGGGCACAGA 
GGGGTCC^CTCTCCTGCCTGCTGGGACCTATGGAAGGGGCTTCCTGGCCAAGGCCCCCTCTTCCAGAGGA 

GGAGCAGGGATCTGGGTTTCCTTTTTTTTCT 

AAGGTGAGGGCTTCGGGTGGTGCTATAATGTGGCACTGGATCTTGAGTAATAAATTTGC^ 
AAAAAAAAAAAAAAAAAAA 

Human ARF5 mRNA sequence - var4 (public gi: 30583012) (SEQ ID NO: 343) 

ATGGGCCTCACCGTGTCCGCGCTCTTC 

GCTTGGATGCGGOTGGCAAGACCACAATCCTGTAGAAACTGAAGTTGGGGGAGATTGTCA^ 

AACCATAGGCTTCAATGTAGAAACAGTGGAATATAA 
CAGGACAAGATTCGGCCTCTGTGGCGGCACTACTTCC^^ 

GTAATGACC^GGAGCGGGTCCAAGAATCTGCTGATGAACTCCAGAAGATGCTGCAGGAGGACGAGCTGCG 
GGATGCAGTGCTGCTGGTATTTGCCAACAAGCAGGACATGCCCAACGCCATGCCCGTGAGCGAGCTGACT 
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Human ARF5 nrotein sequence - varl (public gi: 305830X3) (SEQ JD NO: 224) 

DKLGLQHLRSRTWYVQATCATQGTGLYDGLDWLSHEIiSKR 



Dniqene Name; ATP6V0C Uniqene IDt Hs. 389107 

Human ATP6V0C raRNA sequence - varl (public gi: 33 874373) (SEQ ID NO: 345) 
gSttoagagcV^^^ 

SccgIgtccaagagcggccccgagtatgcttcgtttttcgccgtca^ 
cSSgcgccc^gcgctgcctatggcacagccaagagcggtaccggcattgcggccatot 

cggctcgagcagat^tga^^^ 

5cg?ccScco^ 
agag^SctS 

t^cctgttcctgcgtctgcggagcggcccttgtctcccagctatctataac^ttagctagagt 
ttgtgggttcctgttgctgagacttcctggatggagccgccctcaccgccgggcccgtggccctgcgcgg 

agctgtgtccaataaagttcttggatgtgaaaaaaaaa^ 

Human ATP6V0CmRNA sequence- var2 (pubUcgi: 33 8723 90) (SEQ ro NO: 346) 

GGCTC^CGGGCCGGATCGCCTTCGCCGC^^ 
GCCTCCGCCACCGCCTCGGCCCGCAGAGCTTGCCCCCra^ 

GGCCCCGAGTATGCTTCGTTTTTCGCCGTCATGGGCGCCTCGGCCGCCATGGTCTTCAGCGCCCTGG^ 
CTGCCTATGGCACAGCCAAGAGCGGTACCGGCATTGCGGCCATGTCTGTCATGCGGCCGGAGCAGATCAT 
GAAGTCCATCATCCCAGTGGTCATGGCTGGCATCATCGCCATCTACGGCCTGGTGGTGGCAGTCCTCATC 
GCCAACTCCCTGAATGACGACATCAGCCTCTACAAGAGCTTCCTCCAGCTG^ 

GCCTGAGCGGCCTGGCAGCCGGCTTTGCCATCGGCATCGTGGGGGACGCTGGCGTGCGGGGCACCGCCCA 
GCAGCCCCGACTATTCGTGGGCATGATCCTGATTCTCATCTTCGCCGAGGTGCTCGGCCTCTACGGTCTC 
ATCGTCGCCOTCATCCTCTCCACAAAGTAGACCCTCTCCGAGCCCACCAGCC^CAGAATATTATG^G 

A^CACCCCTCCTCATTCCAGAACGAACAGCCTGACACATACGCACGGGGCCGCCGCC 
CTTGTACATGCGCAGTGTCCTAGTGCCCATCGTCTGTTTCCCCGGCCTTGCCCCCGCCCGCCCCGTGCCG 
TGGACATCTGGGCCCACTCATCGCCCCTCCAGGCCCCCGGCGCCCCACCCCCTAGAGTGCTCTGTGTA 
CGGATGATTTAGAATTGTCATTTCTCTTTACTGGATGTTTATTTATAAAGATCTGGCCTG'l-ri-l, l uuu 1 L. 
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TCreGGATGTGAAAAAAAAAAAAAAA^ 
AAAAAAAAAAAAAA 

Human ATO6V0C mKNA sequence- var3 ^^"Z^1 ) T SS C S^ 7> 

G^CTCATGGCTGGCATC^TCGC^^ 

S^GGCTTTGCCATCGGCATCGTGGC^GACGCTGGCGTGCGGGGCACCGCCCAGC^GCC^ 

gtcggcatgatcctgatt^ 

tctc^a^aa^gtSaccctctccgagcccaccagccacagaatattatgtaaagaccacccctcctcatt 

' GTCC^GTGCCCATCGTCTGTTTCCCCGGCCTTGCCCCCGCCCGCCCCGTGCCGTGGA^ 

gtcaSctctttactggatgtttatttataaagatctggcctgttcctgcgtctgcg^ 

AAAAAAAAAAAAAAAAAAA 




Human ATP6V0CJ mivNA sequence - van> {jmoiic gi: i^i^jo; ^ 

GTTCTGCGGTGCTGGTATTTAGAGCGCAGCGGCTGACGGGCCGGATCGCCTTCGCCGCCGCCCGCCCGCA 
AACCTTCGTGCCCGGCCCGTCCTCGCCCCCGCCTCCGCCACCGCCTCGGCCCGCAGAGCTTGCCCCCTCC 
CCACCCGCAGAC^TGTCCGAGTCCAAGAGCGGCCCCGAGTATGCTTCGTTTTTCGCCGTCATGGGCGCCT 
CGGCCGCCATGGTCTTCAGCGCCCTGGGCGCTGCCTATGGCACAGCCAAGAGCGGTACCGGCATTGCGGC 
CATGTCTGTCATGCGGCCGGAGCAGATCATGAAGTCCATCATCCCAGTGGTCATGGCTGGCATCATCGCC 

ATCTACGGCCTGGTGGTGGCAGTCOTCATCGCCAACTC^ 

TCCTCCAGCTGGGCGCCGGCCTGAGCGTGGGCCTGAGCGGCCTGGCAGCCGGCTTTGCCATCGGCATCGT 
GGGGGACGCTGGCGTGCGGGGCACCGCCCAGCAGCCCCGACTATTCGTGGGCATGATCCTGATTCTCATC 

TTCGCCGAGGTGCTCGGCCTCTACX3GTCTG^^ 
AGCCC^CCAGCCACAGAATATTATGTAAAGACCACCCCTC 

CGCACGGGGCCGCCGCCCCCAGTAGTTGGTCTTGTACATGCGCAGTGTCCTAGTGCCCATCGTCTGTTTC 



Human ATP6V0C raRN A sequence - var6 (public gi: 34534447) (SEQ ID NO: 350) 
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TTTATGCTTGTGTTTCTGCAACTGCTTTCTGGCCCCCC^CTCrTTCTGTGGCTGCTGAG 

^^GGTCracrecrGCAGTCTGG^ 

ScScotgctcccctctgtagaaaagggcctggctcacttcctgcctctggtggactactggcct 

^GTCCACTACTTGGGTTGCT^ 

SgSc^atgotgccagggaaa^ 

SgSccc^gS^ 

ggIggcoSot^^ 

agcaccgtcgtcttg^tgcctctgcaggtgct^ 

ctogagcaStctctgtgcctggaggtc^ccgcgcctgtgtcctcacccctgctgagtgct^ 
ggtaggcaagttttgtggcccaaatatatcu^taaaatatgaagaggaatggtaggggtagtcct 

cttcca^ctgac^tatgtagtc^^ 

gtgtgtgtgtgtctgtcagagattcactcttgttgtttgtttgagacggagtctctctgtgtcgcccagg^ 
ctggagtgcagtggcgtgatottgaotcactgcaacctccactcctgggttcaggcgattctcctgcctc 
agcctccctagtaactgggatgacaggcatgcgccaccactcctggctaatttttgtatttttagtagag 
acgaggtttc^cc^tgttacccaggctaatctcgaacttcggatcacctgaggtcaggagttggagacca 
gcctggccaacatggtgaaaccccatctctactaaaaatacaaagaaagttagccaggtctggtggtc^^ 
tgcctgtaatcccacttactcgggaggctgaggcaggagaatcacttgaacccaggaggcagaggttaca 
gtgagccgagatcgcgccactgcactccaccctgggcarcaagagcgaaaactgtctcaaaaaaaaa^ 

Af^AAAATTTTTCATTTGAGGTATTCTTCCAGTAGAAGGTTAGTAAGTTTTTAATGAAACCATTAAAAATT 
ACACTTCCCAGAAAATAGATGACATCAGTGCCCCTTGCTACTTTCTCAGTCCTCACTATTGCTTTGAGGG 
CCCAGGTACTGAAACTGGTTGTCTTGAGTTTTGTGTCAGCTTTTTCTCCAGTCCATTATCCCCCTCCCTT 
GCTTCTGAAGCAGTCTAGGTTAAACTAGCCAGGCAGGTAGTTGTGGACTGGTGATTTTCAAAAGCCCCAC 
TTTAGAGATCAGGCCACAGCTTTTTATATCGCACAGGACACATCAGCCTGAGCTGCTGCCTCATGCCTGT 
TTCCCCAGGAACCTCACTCCTTTGGTAGAACCTTGGGATTTTAGAAATTGTGGCTTTTCCATAACTCATT 
-TACTCCAACAGTTGAAGTTACACACATTGCTCCCAAATTTGGAAATAGACCACAGTACCTTACCTTTCAT 
TCCCCATCTGGCCTTTACCTTCTTTGCTTCAGTGGTTGAAAACAGTTGCCATATTCAAAGTATAGTAGAT 
TTCAACCTCACACAAATGACAAGTCCCATTTTACAATCCTAGGAAGGCCCACCAATTTCATTTCACGCGC 
CAGGGCGGCTGCAGTTGGAGGCCGAGGGCAGCCCTCTGCTCACTGAATGTCTTGCATGTGCTGACTGCTG 
CCCGCAGTGCTGAACATGCCCCACCGCCCAGGCCCAGCACTGCTTGTTGGGTCAG 

Human ATP6V0C mRNA sequence - var7 (public gi: 30583148) (SEQIDNO: 351) 

ATGTCCGAGTCCAAGAGCGGCCCCGAGTATGCTTCGTTTTTCGCCGTCATGGGCGCCTCGGCCGCCATGG 
TCTTCAGCGCCCTGGGCGCTGCCTATGGCACAGCCAAGAGCGGTACCGGCATTGCGGCCATGTCTGTCAT 
GCGGCCGGAGCAGATCATGAAGTCCATCATCCCAGTGGTCATGGCTGGCATCATCGCCATCTACGGCCTG 
GTGGTGGCAGTCCTCATCGCCAACTCCCTGAATGACGACATCAGCCTCTACAAGAGCTTCCTCCAGCTGG 
GCGCCGGCCTGAGCGTGGGCCTGAGCGGCCTGGCAGCCGGCTTTGCCATCGGCATCGTGGGGGACGCTGG 
CGTGCGGGGCACCGCCCAGCAGCCCCGACTATTCGTGGGCATGATCCTGATTCTCATCTTCGCCGAGGTG 
CTCGGCCTCTACGGTCTCATCGTCGCCCTCATCCTCTCCACAAAGTAG 

Human ATP6V0C protein sequence - varl (public gi: 3 0583149) (SEQIDNO: 225) 

MSESKSGPBYASFFAVMGASAAMVFSALGAAYGTAKSGTGIAAMSVMRPEQIMKSIIPVVMAGIIAIYGL 
— w . rt ni.^iir!r.m;m.cnr.» nrjRBTfiTVRDAnVRGTAOOPRIiFVGMIIiIIjIFAEV 



MSESKSGPBYASFFAVMGASAAMVFSALGAAYGTAKSGTGIAAMSVPWFKUJ.mi^J.x^vvr.^xx^.^^ 
WAVLIANSLITODISIiYKSFLQLGAGLSVGIiSGIiAAGFAIGIVGDAGVRGTAQQPRIjFVGMIIiIIjIFAEV 

LGLYGLIVALIIjSTK' 

Human ATP6V0C protein sequence - var2 (public gi: 34534448) (SEQ ID NO: 226) 

MILPAALC^GPGSLPASGGLIJ^QGPLLGLLSSLYSVSCQRVCHALVSCAYMMQI.DWW3LQWEPPKMH 
CNCSICLPGKJCCTVTRSSGQAI^QALGPSLQVHWVLAWHVWAPPGWRGGGRVAPWPMQPPSSLYSHSIJ} 

TQHRRLGCLCRCYPEPLSYCLVFL 

Human ATP6V0C pray sequence - varl (SEQ ID NO: 352) m „r,^r. 

CCGCCATGGAGTACCCATACGACGTACCAGATTACGCTCATATGGCCATGGAGGCCAGTGAATTCCACCC 
AAGCAGTGGTATCAACGCAGAGTGGCCATrTGGGGGGTCTGCGGTGCTGGTATTTAGAGCGCAGCGGCTG 
ACGGGCCGGATCGCCTTCGCCGCCGCCCGCCCGCAAACCTTCGTGCCCGGCCCGTCCTCGCCCCCGCCTC 
CGCCACCGCCTCGGCCCGCAGAGCTTGCCCCCTCCCCCCCATGTCGGCCGCCTCGGCCTCTAGAGGGTGG 
GCATCGATACGGGATCCATCGAGCTCGAGCTGCAGATGAATCGTAGATACTGAAAAACCCCGCAAGTTCA 

CTTCAACTGTGCATTCGTGCA 

Unigene Name : CBLB Pnigene ID : Hs.3144 Clone ID : 3GD_114 
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Human CBL-B mRNA sequence - varl (public gi: 4757919) (SEQ ID NO: 353) 

CTGGGTCCTGTGTGTGCCAGBX3GGGTGGGGTGTCCAGCG 

GGCGTCCCGCGGCCTCCCCGAGTCGGGCGGGAGGGGAGAGCGGGTGTGGATTTGTCTTGACGGTAATTGT 
TGCGTTTCCACGTCTCGGAGGCCTGCGCGCTGGGTTGCTCCTTCTTCGGGAGCGAGCTGTTCTCAGCGAT 
CCCACTCCCAGCCGGGGCTCCCCACACACACTGGGCTC 

TGTCTCTGGAeAGCTACGGCGCCGAAAGAACTAAAATTCCAGATGGCAAACTCAATGAATGGCAGAAACC 
"CTGGTGGTCGAGGAGGAAATCCCCGAAAAGGTCGAATTTTGGGTATTATTGATG 
TGGACCCCCTAAGCAAGCTGCCGCAGATCGCAGGACCGTG 
GTAAGACTGTGCCAAAATCCCAAACTTCAGTTGAAAAATAG 
ATACATATCAGCATTTACGACTTATATTGAGTAAATATGA^ 

GAATGAGTACTTTAAAATCTACATTGATAGCCTTATGAAAAAGTCAAAACGGGCAATAAGACTCTTTA2\A ^ 

GAAGGC^GGAGAGAATGTATGAAGAACAGT(^CAGGACAGACGAAATCTCACAAAACTGTCCCTTATCT 

TCAGTC^CATGCTGGCAGAAATCAAAGCAATCTTTCCCAATGGTCAATTCC^GGGAGATAACTTTCGTAT 

CACT^AAAGCAGATGCTGCTGAATTCTGGAGAAAGTTTTTTGGAGACAAAACTATCGTACCATGGAAAGTA 

TTCAGACAGTGCCTTCATGAGGTCCACGAGATTAGCTOTAGCCTGGAAGCAATGGCTCTAAAATCT^ACAA 

TTGATTTAACTTGCAATGATTAC^TTT 

GGGCTCTATTTTGCGGAATTGGAATTTCTTAGCTGTGACACATCCAGGTTACATGGCATTTCTCACATAT 

GATGAAGTTAAAGGACGACTACAGAAATATAGCACCAAACCCGGAAGCTATATTTTCCGGTTAAGTTGCA 

CTCGATTGGGACAGTGGGCCATTGGCTATGTGACTGGGGATGGGAATATCTTACAGACCATACCTCATAA 

CAAGCCCTTATTTCAAGCCCTGATTGATGGCAGCAGGGAAGGATTTTATCTTTATCCTGATGGGAGGAGT 

TATAATCCTGATTTAACTGGATTATGTGAACCTACACCTCATGACCATATAAAAGTTACACAGGAACAAT 

ATGAATTATATTGTGAAATGGGCTCCACTTTTCAGCTCTGTAAGATTTGTGCAGAGAATGACAAAGATGT 

CAAGATTGAGCCTTGTGGGCATTTGATGTGCACCTCTTGCCTTACGGCATGGCAGGAGTCGGATGGTCAG 

GGCTGCCCTTTCTGTCGTTGTGAAATAAAAGGAACTGAGCCCATAATCGTGGACCCCTTTGATCCAAGAG 

ATGAAGGCTCCAGGTGTTGCAGCATCATTGACCCCTTTGGCATGCCGATGCTAGACTTGGACGACGATGA 

TGATCGTGAGGAGTCCTTGATGATGAATCGGTTGGCAAACGTCCGAAAGTGCACTGACAGGCAGAACTCA 

CCAGTCACATCACCAGGATCCTCTCCCCTTGCCCAGAGAAGAAAGCCACAGCCTGACCCACTCCAGATCC 

CACATCTAAGCCTGCCACCCGTGCCTCCTC^CCTGGATCTAATTCAGAAAGGCATA 

TGGCAGCCCAACAGGTTCACCAAAGTCTTCrCCTTGCATGGTGAGAAAACAAGATAAACCAC 

CCACCTCCTCCCTTAAGAGATCCTCCTCCACCGCCACCT 

GACTGAGTAGACACATCCATCATGTGGAAAGCGTGCCTTCCAGAGACCCGCCAATGCCTCTTGAAGCATG 
GTGCCCTCGGGATGTGTTTGGGACTAATCAGCTTGTGGGATGTCGACTCCTAGGGGAGGGCTCTCCAAAA 
CCTGGAATCACAGCGAGTTCAAATGTCAATGGAAGGCACAGTAGAGTGGGCTCTGACCCAGTGCTTATGC 
GGAAAGACAGACGCCATGATTTGCCTTTAGAAGGAGCTAAGGTCTTTTCCAATGGTCACCTTGGAAGTGA 
AGAATATGATGTTCCTCCCCGGCTTTCTCCTCCTCCTCCAGTTACCACCCTCCTCCCTAGCATAAAGTGT 
ACTGGTCCGTTAGCAAATTCTCTTTCAGAGAAAAC^GAGACCCAGTAGAGGAAGATGATGATGAATACA 
AGATTCCTTCATCCCACCCTGTTTCCCTGAATTCACAACCATCTCATTGTCATAATGTAAAACCTCCTGT 
TCGGTCCTGTGATAATGGTCACTGTATGCTGAATGGAACACATGGTCCATCTTCAGAGAAGAAATCAAAC 
ATCCCTGACTTAAGCATATATTTAAAGGGTACGTATAGAATATAATTTCCTTTGTGATGTACATCTTAAT 
GGTCAGAATTTAAAGGCAAAATTTCATGCCATTGTACTGAAAATACATTAAGGTTTTGTGTTATCCTCTA 
GGAGATGTTTTTGATTCAGCCTCTGATCCCGTGCCATTAC 

CAAAGCATGGTTCTTCACTCAAGAGGACGCCCTCTGATTATGATCTTCTCATCCCTCCATTAGGTTGA^ 
CCTTTAAAAAAGTTTTGAACAACCCACCCCTCCTTCTTTTAATTTCAGAATTTTCAGAATTCAGAGTTCA 

GTATAAC^CAGACTGACTGGGTTGTGAATTTGCCT 

CCAAGTTCACGAGACCATTACTCCATGTAGATGATTAAGGTAGTAGTGTAGTAGTTGGGCATCAGTCAGG 
TTTTAAGCAAGTTGTTTTGTCC^TACTA 

TTTGAAGTGATGACnTGAAGTGTTGAGATTTTCTTTAAGTATAATAATTCTTAATAAATATGAACTTGCT 

TTTCTTGCAGCATGAGCACCAGTTCCACTTACGCTAATTAAATTATGCAAAA 

AGAACTGATAATAAATTCTGTTTTATTCTAATCAT^ 

Human CBL-B mRNA sequence - var2 (public gi: 23273908) (SEQ ID NO: 354) 

AGCGGAGTGCTGCTGCGGCGTCCCGCGGCCTCCCCGAGTCGGGCGGGAGGGGAGAGCGGGTGTGGATTTG 
TCTTGACGGTAATTGTTGCGTTTCCACGTCTCGGAGGCCTC 

AGCTGTTCTCAGCGATCCCACTCCCAGCCGGGGCTCCCCACACACACTGGGCTGC^ 

GACCCGCGCACACGCGTGTCTCTGGACAGCTACGGCGCCGAAAGAACTAA7UVTTCCAGATGGCAAACTCA 

ATGAATGGCAGAAACCCTGGTGGTCGAGGAGGAAATCCCCGAAAAGGTCGAATTTTGGGTATTATTGATG 

CTATTCAGGATGCAGTTGGACCCCCTAAGCAAGCTGCCGCAGATCGCAGGACCGTGGAGAAGACTTGGAA 

GCTCATGGACAAAGTGGTAAGACTGTGCCAAAATCCCAAAC^T 

CTTGATATTTTGCCTGATACATATCAGCATTTACGACTT^ 

TTGCCCAACTCAGTGAGAATGAGTACTTTAA7VATCTACATTGATAGCCTTATGAAAAAGTCAAAACGGGC 
AATAAGACTOTTTAAAGAAGGCAAGGAGAGAATGTATGAAGAACAGTCACAGGACAGACGAAATCTCACA 
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^mataactScgtatcacaaaagcagatgctgctgaa 

^TACC^TCGA^ 
GCTCTAAi^T 

GGCTGTTTCAGCCTTGGGGCTCTATTTTGCGGAAT^ 
GGCATTTCT<^CATATGATGAAGTT^^ 

TOCCGGTTAAGTTGCACTCGATTGGGACAGTGGGCCATTGGCTATGTGACTGGGGATGGGAATATCTTAC 
AGACCATACCTCATAACAAGCCCTTATTTCAAGCCCTGATTGATGGCAGCAGGGAAGGATTTTATCTTTA 
TCCTGATGGGAGGAGTTATAATCCTGATTTAACTGGATTATGTGAACCTACACCTCATGACCATAT 
GTTACACAGGAACAATATGAATTATATTGTGAAATGGGCTCCACTTTT C AGCTCTGTAAGATTTGTGCAG 
AGAATGACAAAGATGTC^GATTGAGCCTTC^ 

GGAGTCGGATGGTCAGGGCTGCCCTTTCTGTCGTTGTGAAATAAAAGGAACTGAGCCCATAATCGTGGAT 
CCCTTTGATCCAAGAGATGAAGGCTCCAGGTGTTGCAGCATCATTGACCCCTTTGGCATGCCGATGCTCG 
ACTTGGACGACGATGATGATCGTGAGGAGTCCTTGATGATGAATCGGTTGGCAAACGTCCGAAAGTGCAC 

TQAC3U3GCAGAACTCACCAGTCACATCM 

GACCCACTCCAGATCCCACATCTAAGCCTGCCACCCGTGCCTCCTCGCCTGGATCTAATTCAGAAAGGCA 
TAGTTAGATCTCCCTGTGGCAGCCCAACGGGTTGACCAAAGTCTTCTCCTTGCATGGTGAGAAAACAAGA 
TAAACCACTCCCAGCACCACCTCOTCCCTTAAGAGATCCTCCTCCACCGCCACCTGAAAGACCTCCACCA 
ATCCCACCAGACAATAGACTGAGTAGACACATCCATCATGTGGAAAGCGTGCCTTCCAAAGACCCGCCAA 
TGCCTCTTGAAGC^TGGTGCCCTCGGGATGTGTTTGGGACTAATCAGCTTGTGGGATGTCGACTCCTAGG 
GGAGGGCTCTCCAAAACCTGGAATCACAGCGAGTTCAAATGTCAATGGAAGGCACAGTAGAGTGGGCTCT 
GACCCAGTGCTTATGCGGAAACACAGACGCCATGATTTGCCTTTAGAAGGAGCTAAGGTCTTTTCCAATG 
GTCACCTTGGAAGTGAAGAATATGATGTTCCTCCCCGGCTTTCTCCTCCTCCTCCAGTTACCACCCTCCT 
CCCTAGCATAAAGTGTACTGGTCCGTTAGCAAATTCTCTTTCAGAGAAAACAAGAGACCCAGTAGAGGAA 
GATGATGATGAATACAAGATTCCTTCATCCCACCCTGTTTCCCTGAATTCACAACCATCTCATTGTCATA 
ATGTAAAACCTCCTGTTCGGTCTTGTGATAATGGTCACTGTATGCTGAATGGAACACATGGTCCATCTTC 
AGAGAAGAAATCTVAACATCCCTGACTTAAGCATATATTTAAAGGGAGATGTTTTTGATTCAGCCTCTGAT 

CCCGTGCCATTACCACCTGCC^GGCCTCCAACTCGG 
CGCCCTCTGATTATGATCTTCTCATCCCTCCATTAGGTGAAGA 

CCCACCTCCCCCACCTCCTGCAAGGCATAGTCTC^TTGAACATTCAAAACCTCCTGGCTCCAGTAGCCGG 
CCATCCTCAGGACAGGATCTTTTTCTTCTTCCTTGAGATCCCTTTGTTGATCTAGCAAGTGGCCAAGTTC 
CTTTGCCTCCCGCTAGAAGGTTACCAGGTGAAAATGTCAAAACTAAC^GAACATCACAGGACTATGATCA 
GCTTCCTTGATGTTCAGATGGTTCACAGGCACCAGCCAGACCCCCTAAACCACGACCGCGCAGGACTGCA 
CCAGAAATTCACCACAGAAAACCCCATGGGCCTGAGGCGGCATTGGAAAATGTCGATGCAAAAATTGCAA 
AACTCATGGGAGAGGGTTATGCCTTTGAAGAGGTGAAGAGAGCCTTAGAGATAGCCCAGAATAATGTCGA 
AGTTGCCCGGAGCATCCTCCGAGAATTTGCCTTCCCTCCTCCAGTATCCCCACGTCTAAATCTATAGCAG 
CCAGAACTGTAGACACCAAAATGGAAAGCZAATCGATGTATTCCAAGAGTGTGGAAATAAAGAGAACTGAG 
ATGGAATTCAAGAGAGAAGTGTCTCCTCCTCGTGTAGCAGCTTGAGAAGAGGCTTGGGAGTGCAGCTTCT 

CAAAGGAGACCGATGCTTGCTCAGGATGTCXSACAGCTGTGGCT^ 

AATCAGGGTTGAACTGACAAAAATAATTTAAAGACGTTTACTTCCCTTGAACTTTGAACCTGTGAAATGC 
TTTACCTTGTTTACAATTTGGCAAAGTTGCAGTTTGTTCTTGTTTTTAGTTTAGTTTTGTTTTGGTGTTT 
TGATACCTGTACTGTGTTCTTCAC^GACCCTTTGTAGCGTGGTCAGGTCTGCTGTAACATTTCCCACCAA 
CTCTCTTGCTGTCC^CATCAACAGCTAAATCATTTATTCATATGGATCTCTACCATCCCCATGCCTTGCC 
CAGGTCCAGTTCCATTTCTCTCATTCACAAGATGCTTTG 

GCAAAAAAAAAAGTATGTATTCTTmCTACTGAGTTTCTTCTTTGGAAACCATCACTATTG 
AAAAACCTGAATGTATAAAGCATTTATTTGTCAATAAACTGCCTTTTGTAAGGGGTTTTCACAAAA 

AAAAAAAA 

Human CBL-B mRNA sequence - var3 (public gi: 862406) (SEQ ID NO: 355) 

CTGGGTCCTGTGTGTGCCACAGGGGTGGGGTGTCCAGCGAGCGGTCTCCTCCTCCTGCTAGTGCTGCTGC 
GGCGTCCCGCGGCCTCCCCGAGTCGGGCGGGAGGGGAGAGCGGGTGTGGATTTGTCTTGACGGTAATTGT 
TGCGTTTCCACGTCTCGGAGGCCTGCGCGCTGGGTTGCTCCTTCTTCGGGAGCGAGCTGTTCTCAGCGAT 

CCCACTCCCAGCCGGGGCTCCCCACACACACTGGGCTGCGT 

TGTCTCTGGACAGCTACGGCGCCGAAAGAACTAAAATTCCAGATGGCAAACTCAATGAATGGCAGAAACC 
CTGGTGGTCGAGGAGGAAATCCCCGAAAAGGTCGAATTTTGGGTATTATTGATGCTATTCAGGATGCAGT 
TGGACCCCCTAAGCAAGCTGCrcCAGATCGCA^ 

GTAAGACTGTGCCAAAATCCCAAACTTCAGTTGAAAAATAGCCCACCATATATACTTGATATTTTG 

ATACATATCAGCATTTACGACTTATATTGAGTAAATATGATGACAACCAGAAACTTGC 

GAATGAGTACTTTAAAATCTACATTGATAGCCTTATGAAAAAGTCAAAACGGGCAATAAGACTCTTTAAA 

GAAGGCAAGGAGAGAATGTATGAAGAACAGTCACAGGACAGACGAAATCTCACAAAACTO 

TCAGTCACATGCTGGCAGAAATGAAAGCAATCTTTCCCAATGGTCAATTCCAGGGAGATAA 

C^CAAAAGCAGATGCTGCTGAATTCTGGAGAAAGTTTTTTGGAGACAA 

TTCAGACAGTGCCTTCATGAGGTCCACCAGATTAGCT 
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TTGATTTAACTTGCAATGATTACATTTCAGTTTTTGAAT^ 

GGGCTCTATTTTGCGGAATTGGAATTTCTTAGCTGTGACACATCCAGGTTACATGGCATTTCTCACATA^ 
GATGAAGTTAAAGCACGACTACAGAAATATAGCACCAAACCCGGAAGCTATATTTTCCGGTTAAGTTGC^ 
CTCGATTGGGACAGTGGGCCATTGGCTATGTGACTGGGGATGGGAATATCTTAGAGACCATACCTCATAA 
CAAGCCCTTATTTCAAGCCCTGATTGATGGC^GCAGGGAAGGATTTTATCTTTATCCTGATGGGAGGAGT 
TATAATCCTGATTTAACTGGATTATGTGAACCTACACCTCATGACCATA^ 

ATGAATTATATTGTGAAATGGGCTCCACTTTTCAGCTCTGTAAGATTTGTGCAGAGAATGACAAAGATG^ 
CAAGATTGAGCCTTGTGGGCATTTGATGTG 

GGCTGCCCTTTCTGTCGTTGTGAAATAAAAGGAACTGAGCCCATAATCGTGGACCCCTTTGATCCAAGAG 
ATGAAGGCTCCAGGTGTTGCAGGATCATTGACCCCTTTGGCATGCCGATGCTAGACT^GGACGACGATGA 
TGATCGTGAGGAGTCCTTGATGATGAATCGGTTGGCAAACGTCCGAAAGTGCACTGACIAGGCAGAACTCA 
C^GT^CATCACCAGGATCCTCTCCCCTTGCCCAGAGAAGAAAGCCACAGCCTGACCCACTCCAGATCC 

C^CATCTAAGCCrGCCACCCGTGCCTCCTCGC^^ 

TGGCAGCCCAACAGGTTCACCAAAGTCTTCTCCTTGCATGGTGAGAAAACAAGATAA 

CCACCTCCTCCCTTAAGAGATCCTCCTCC^C^^ 
GACTGAGTAGAGACATCCATGATGTGGAAAGCGTGCC 

GTGCCCTCGGGATGTGTTTGGGACTAATCAGCTTGTGGGATGTCGACTCCTAGGGGAGGGCTCTCCAAAA 

CCTGGAATCACAGCGAGTTCAAATGTCAATGGAAGGCACAGTAGAGTGGGCTCTGACCCAGTGCTTATGC 

GGAAAGACAGACGCCATGATTTGCCTTTAGAAGGAGCTAAGGTCTTTTCGAATGGTC^ 

AGAATATGATGTTCCTCCCCX5GCTTTCTCCTCCTCCTCCAGTTACCACCCTCCTCCCTAGCATAAAGTGT 

ACTGGTCCGTTAGCAAATTCTCTTTCAGAGAAAACAAGAGACCCAGTAGAGGAAGATGATGATGAATACA 

AGATTCCTTCATCCCACCCTGTTTCCCTGAATTCACAACCATCTCATTGTCATAATGT 

TCX3GTCCTGTGATAATGGTCACTGTATGCTGAATGGAACACATGGTCCATCTTCAGAGAAGAAATCAAAC 

ATCCCTGACTTAAGCATATATTTAAAGGGAGATGTTTTTGATT 

CTGCCAGGCCTCCAACTCGGGACAATCCAAAGCATGGTTCTTCACTCAACAGGACGCCCTCTGATTATGA 
TCTTCTCATCCCTCCATTAGGTGAAGATGCTTTTGATGCCCTCCCTCCATCTCTCCCACCTCCCCCACCT 
CCTGCAAGGCATAGTCTCATTGAACATTCAAAACCTCCTGGCTCCAGTAGCCGGCCATCCTCAGGACAGG 
.ATCTTTTTCTTCTTCCTTCAGATCCCTTTC 

AAGGTTACC^GGTGAAAATGTCAAAACTAACAGAACATCACAGGACTATGATC^GCTTCCTTCATGTTCA 
GATGGTTCAGAGGCACCAGCCAGACCCCCTAAACCACGACC^ 

GAAAACCCCATGGGCCTGAGGCGGCATTGGAAAATGTCGATGCAAAAATTGCAAAACTCATGGGAGAGGG 
TTATGCCTTTGAAGAGGTGAAGAGAGCCTTAGAGATAGCCCAGAATAATGTCGAAGTTGCCCGGAGCATC 
CTCCGAGAATTTGCCTTCCCTCCTCCAGTATCCCCACGTCTAAATCTATAGCAGCGAGAACTGTAG 
CAAAATGGAAAGCAATOSATGTATTCCAAG 

AAGTGTCTCCTCCTCGTGTAGCAGCTTGAGAAGAGGCTTGGGAGTGCAGCTTCTCZAAAGGAGACCGATGC 

TTGCTCAGGATGTCGACAGCTGTGGCTTCCTTGTTTTTGCTAGCCATATTTTTAAATCAGGGTTGAACTG 

ACAAAAATAATTTAAAGACGTTTACTTCCCTTGAACTTTGAACCTGTGAAATGCTTTACCTTGTTTACAA 

TTTGGCAAAGTTGCAGTTTGTTCTTGTTTTTAGTTTAGTTTTGTTTTGGTGTTTTGATACCTGTACTGTG 

TTCTTCACAGACCCTTTGTAGCGTGGTCAGGTCTGCTGTAACATTTCCC^CCAACTCTCTTGCTGTCCAC 

ATGAACAGCTAAATCATTTATTGATATGGATCTCT^ 

TCTCTGATTGACAAGATGCTTTGAAGGTTCT 

TGTATTCTTCACTACTGAGTTTCTTCTTTGGAAACCATCACTATTGAGAGATGGGAAAAACCTGAATGT^ 
TAAAGC^TTTATTTGTCAATAAACTGCCTTTTGTAAGGGGTTTTCACATAAAAAAAAAAAAA 

Human CBL-B mRNA sequence - var4 (public gi: 862408) (SEQ ID NO: 356) 

CTGGGTCCTGTGTGTGCCACAGGGGTGGGGTGTC 

GGCGTCCCGCGGCCTCCCCGAGTCGGGCGGGAGGGGAGAGCGGGTGTGGATTTGTCTTGACGGTAATTGT 
TGCGTTTCCACGTCTCGGAGGCCTGCGCGCTGGGTTGCTCCTTCTTCGGGAGCGAGCTGTTCTCAGCGAT 
CCCACTCC^GCCGGGGCTCCCCACACAGACTGGGCTGCGTGCGTG 

TGTCTCTGGACAG CTACGGCGCCGAAAGAACTAAAATTCCAGATGG CAAACT CAATGAATGGCAGAAACC 
CTGGTGGTCGAGGAGGAAATCCCCGAAAAGGTCGAATTTTGGGTATTATTGATGCTATTCAGGATGCAGT 
TGGACCCCCTAAGCAAGCTGCCGCAGATCGCAGGACCGTGGAGAAGACTTGGAAGCTCATGGAGAAAGTG 
GTAAGACTGTGCCAAAATCCCAAACTTCAGTTGAAAAAT^ 

ATAC^TATGAGCATTTACGACTTATATTGAGTAAATATGATGAjCAACCAGAAACTTGCCCAAC 

gaatgagtactttaaaatctacattgatagccttatgaaaaagtcaaaacgggcaataagactctttaaa 

gaaggcaaggagagaatgtatgaagaacagtc^caggacagacgaaatctc^ 

tcagtcacatgctggcagaaatcaaagcaatct^ 

cacaaaagcagatgctgctgaattctggagaaagttttttggagacaaaactatcgtaccatgga 

ttcagacagtgccttc^tgaggtccaccagattagctcta 

ttgatttaacttgcaatgattacatttgagtttttgaatttgatattt 

gggctctattttgcggaattggaatttcttagctgtgacagatccaggttacatggcatt 

gatgaagttaaagc^cgactacagaaatatagcaccaaacccggaagctatattttccggttaag 

ctcgattgggacagtgggccattggctatgtgactggggatgggaatatcttacagaccatacctcataa 
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tScctctgataatggtc^^ 

atccctScttaagcmatatotaaagggagatgtttttgattcagcctct 
SgccIggcS^tcgggacaatccaaagcatggttcttcactc 

tStScat^ 

TTCAGAATTTTCAGAATTCAGAGTTCAGTATAACACAGACTCACTGGGTTGTGAATTTGCCTGAAMreTG 

aItC^TCT^cSgCCGG^^ 
TAGTGTAGTAGTTGGGCATC^GTC^ 

CACATGAGAGCTCTGTGCTCTAGTAGTTTTGAAGTGATC 

ATAATTCTTAATAAATATGAACTTGCTTTTCTTGCAGCATGAGCACCAGTTCCACTTACGCTAATTA 

TATGCAAAATTAAATAGTTGTATGTAGAGAACTGATAATAAATTCTGTTTTATTCTAATC^ 

TAACACATTCAAAAAAAAAAA 

Human CBL-B raKNA sequence - var5 (public gi: 862410) (SEQ ID NO: 357) 

CTGGGXCCTGTGTGTGCCACAGGGGTGGGGTGTC^^ 

GGCGTCCCGCGGCCTCCC03AGTCGGGCGGGAGGGGAGAGCGGGTGTGGATTTGTCTTGACGGTAA^GT 
TCCGTTTCCACGTCTCGGAGGCCTGCGCGCTGGGTTGCTCCTTCTTCGGGAGCGAGCTGT 
CCCACTCCCAGCCGGGGCTCCCCACACACACTGGGCTGCGTGCGTGTGGAGTGGGACCCGCGCA 
TGTCTCTGGACAGCTACGGCGCCGAAAGAACTAAAATTCCAGATGGCAAACTCAATGAATGGCAGAAACC 

CTGGTGGTCGAGGAGGAAATCCCCGAAAAGGTCGAATTTTGGGTATTATTGATG 
TGGACCCCCTAAGCAAGCTGCCGCAGATCGCAGGACCGTGGAGAAGACTTGGAAGCTCATGGACA^ 

GTAAGACTGTGCCAAAATCCCAAACTTCAGT^^ 

ATA(^TATCAGC^TTTACGACTTATATTGAGTAAATATGATGACAACCAGAAACTTGCCCAACTCAGTGA 

GAATGAGTACTTTAAAATCTACATTGATAGCCTTATGAAAAAGTCAAAACGGGCAATAAGACTCTTTAAA 

GAAGGCAAGGAGAGAATGTATGAAGAACAGTCACAGGACAGACGAAATCTCACAAAACTGTCCCTTATCT 

TCAGTCACATGCTGGCAGAAATCAAAGCAATCTTTCCCAATGGTCAATTCCAGGGAGATAACTTTC^ 

CACAAAAGCAGATGCTGCTGAATTCTGGAGAAAGTTTTTTGGAGACyUiAACTATCGTACCATGGAAAGTA 

TTCAGACAGTGCCTTCATGAGGTCCACCAGATTAGCTCTAGCCTGGAAGCAATGGCTCTAAAATCAACAA 

TTGATTTAACTTGCAATGATTACATTTCAGTTTTTGAATTTGATATTTTTACCAGGCTGTTTCAGCCT 

GGGCTOTATTTTGCGGAATTGGAATTTCTTAGCTGTGACACATCCAGGTTACATGGCATTTCTCACATAT 

GATGAAGTTAAAGCACGACTACAGAAATATAGCACCAAACCCGGAAGCTATATTTTCCGGTTAAGTTGCA 

CTCGATTGGGACAGTGGGCCATTGGCTATGTGACTGGGGATGGGAATATCTTACAGACCATACCTCATAA 

C3\AGCCCTTATTTCAAGCCCTGATTGATGGCAGCAGGGAAGGATTTTATCTTTATCCTGATGGGAGGAGT 

TATARTCCTGATTTAACTGGATTATGTGAACCTACACCTCATGACCATATAAAAGTTACACAGGAACAAT 

ATGAATTATATTGTGAAATGGGCTCCACTTTTCAGCTCTGTAAGATTTGTGCAGAGAATGACAAAGATGT 

CAAGATTGAGCCTTGTGGGCATTTGATGTGCACCTCTTGCCTTACGGCATGGCAGGAGTCGGATGGTCAG 

GGCTGCCCTTTCTGTCGTTGTGAAATAAAAGGAACTGAGCCCATAATCGTGGACCCCTTTGATCCAAGAG 

ATGAAGGCTCCAGGTGTTGCAGCATCATTGACCCCTTTGGCATGCCGATGCTAGACTTGGACGACGATGA 

TGATCGTGAGGAGTCCTTGATGATGAATCGGTTGGCy^CGTCCGAAAGTG(^CTGACAGGCAGAACTCA 

CC^GTCAC^TCyvCCAGGATCCTCTCCCCTTGCCCAGAGAAGAAAGCCACAGCCTGACCCACTCCAGATCC 

CACATCTAAGCCTGCCACCXGTGCCTCCTCX3CCTGGATCTAATTCAGAAAGGCATAGTTAGATCTCCCTG 

TGGCAGCCCAACAGGTTCACCAAAGTCTTCTCCTTGCATGGTGAGAAAACAAGATAAACC^CTCCCAGCA 

CCACCTCCTCCCTTAAGAGATCCTCCTCCACCGCCACCTGAAAGACCTCCACCAATCCCACCAGACAATA 

GACTGAGTAGAC^CATCCaTCATGTGGAAAGCGTGCCTTCCAGAGACCCGCCAATGCCTCTTGAAGCATG 

GTGCCCT03GGATGTGTTTGGGACTAATCAGCTTGTGGGATGTCGACTCCTAGGGGAGGGCTCTCCAAAA 

CCTGGAATCAC^GCGAGTTCAAATGTCAATGGAAGGCACAGTAGAGTGGGCTCTGACCCAGTGCTTATGC 



Figure 36 part - 15 



p c TV U S O O B 3 01 8 



GGAAACACAGACGCCATGATTTGCCTTTAGAAGGAGCTAAGGTCTT^ 
AGAATAX^^ 

ACTGGTCCGTT AGCAAATT CTCTTTCAGAGAAAACAAGAGACC CAGTAGAGGAAGATGATGATGAATACA 

AGATTCCTTCATCCCACCCTGTTTCCCTGAATTCACAACCATCTCATTO 

TCGGTCCTGTGATAATGGTCACTG^^ 

ATCCCTGACTTAAGCATATATTTAAAGGGTACGTATAGAATATAATTTCCTTTGTGATGTACATCTTAAT 
GGTCAGAATTTAAAGGCAAAATTTCATGCCATTC 

GGAGATGTTTTTGATTCAGCCTCTGATCCCXaTGCCATTACCACCTGCCAGGCCTCCAACT 

CAAAGCATGGTTCTTCACTCAAC^GGACGCCCTCTGATTA^ 

CCTTTAAAAAAGTTTTGAACAACCCACCCCTCCTTCTTTTAATTO 

GTATAACACAGACTCAOTGGGTTGTGAATTTGCCTGAAATTTGAATGGGTTCTCC^ 

CCAAGTTCACGAGACCATTACTCCATGTAGATGATrAAGGTAGTAGTGTAGTAGTTGGGCATCAGTCAGG 
TTTTAAGCAAGTTGTTTTGTCCATACTAAATGTAGTCT^ 

TTTGAAGTGATGACTTGAAGTGTTGAGATTTTCTTTAAGTATAATAATTCTTAATAAATATGAACTTGCT 
TTTCTTGCAGC^TGAGCACCAGTTCCACTTACG 

AGAACTGATAATAAATTCTGTTTTATTCTAATCATTACAACTGTAACACATTCAAA 

Human CBL-B mRNA sequence - var6 (public gi: 21753192) (SEQ ID NO: 358) 

AGTGCTGCTGCGGCGTCCCGCGGCCTCCCCGAGTCGGGCGGGAGGGGAGAGCGGGTGTGGATTTGTCTTG 

ACGGTAATTGTTGCGTTTCCACGTCTCGGAGGCCTGCGCGCTGGGTTGCTCCTTCTTCGGGAGCGAGCTG 

TTCTCAGCGATCCCACTCCCAGCCGGGGCTCCCCACACACACTGGGCTGCGTGCGTGTGGAGTGGGACCC 

GCGCACACGCGTGTCTCTGGACAGCTACGGCGCCGAAAGAACTAAAATTCCAGATGGCAAACTCAATGAA 

TGGCAGAAACCCTGGTGGTCGAGGAGGAAATCCCCGAAAAGGTCGAATTTTGGGTATTATTGATGCTATT 

CAGGATGCAGTTGGACCCCCTAAGCAAGOTGCCGCAGATCGCAAAACCTGGAATCACAGCGAGTTCAAAT 

GTCAATGGAAGGGACAGTAGAGTGGGCTCTGACCCAGTGCTTATGCGGAAACAC^GACGCGATGATTTGC 

CTTTAGAAGGAGCTAAGGTCTTTTCCAATGGTCACCTTGGAAGTGAAGAATATGATGTTCCTCCCCGGCT 

TTCTCCTCCTCCTCCAGTTACC^CCCTCCTCCCTAGC^TAAAGTGTACTGGTCCGTTAGCAAATTCTCTT 

TCAGAGAAAACAAGAGACCCAGTAGAGGAAGATGATGATGAATACAAGATTCCTTCATCCCACCCTGTTT 

CCCTGAATTCACAACCATCTCATTGTCATAATGTAAAACCTCCTGTTCGGTCTTGTGATAATGGTCACT^ 

TATGCTGAATGGAACACATGGTCCATCTTCAGAGAAGAAATCAAACATCCCTGACT 

AAGGGAGATGTTTTTGATTCAGCCTCTGATCCCGTGCC^ 

ATCCAAAGCATGGTTCTTCACTCAACAGGACGCCCTCT 

AGATGCTTTTGATGCCCTCCCTCCATCTCTCC^^ 

CATTCAAAACCTCCTGGCTCCAGTAGCCGGCCATCCTCAGGACAGGATCTTTTTCTTCTTCCTTCAGATC 

CCTTTGTTGATCTAGCAAGTGGCCAAGTTCCTTTGCCTCCTGCTAGAAGGTTACCAGX3TG 

AACTAACAGAACATCAC^GGACTATGATCAGCTTCCTTCATGTTCAGATGGTTCACAGGCATCAGCCAGA 

CCCCCTAAACCACGACCGCGCAGGACTGCACCAGi^ 

CATTGGAAAATGTCGATGCAAAAATTGCAAAACTCATGGGAGAGGGTTATGCCTTTGAAGAGGTGAAGAG 
AGCCTTAGAGATAGCCCAGAATAATGTCGAAGTTGCCCGGAGCATCCTCCGAGAATTTGCCTTCCCTCCT 
CCAGTATCCCCACGTCTAAATCTATAGCAGCGAGAACTGTAGAC^ 

TCCAAGAGTGTGGAAATAAAGAGAACTGAGATGGAATTC^GAGAGAAGTGTCTCCTCCTCGTGTAGCAG 
CTTGAGAAGAGGCTTGGGAGTGCAGCTTCTCAAAGGAGACCGATGCTTGCTCAGGATGTCGACAGCTGTG 

GCTTCCTTGTTTTTGCTAGCCATATTTTTAAATC^ 

CTTCCCTTGAACTTTGAACCK3TGAAATGCTTTACCTTO 

TGTTTTTAGTTTAGTTTTGTTTTGGTGTTTTGTACCTGTACTGTGTTC 

GTCAGGTCTGCTGTAACATTTCCCACCAACTCTCTTGCTGTCCACATCAACAGCTAAATCATTTATT 

ATGGATCTCTACCATCCCCATGCCTTGCCCAGGTCCAGTTC 

GGTTCTGATTTTCAACTGATCAAACTAATGCA 

Human Cbl-b mRNA sequence - var 7 (SEQ ID NO: 359) 

CGTNTTTGGNANNCACTACAGGGGATGTTTAATACACACT 

G 

TAAGATACCCCACTCAAACCCATAAAAAAGAGCATCOT 
A 

CCCATACGACGTACCAGATTACGCTCATATGGCC^ 

GGACTCTGACCCANTGCTTATGCGGAAACACAGACGCCATGATTTGCCTTTAGAAGGAGOT 
C 

ACCTTGGAAGTGAAGAATATGATGTTCCTCCCCG^ 
G 

TGTACTGGTCCGTTAGCAAATTCTOTTTCAGAGAAAAGAAGAGACCCAGTAGAGGAAGATGATGATC 
C 
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TTCATCCCACCCTGTTTCC^^ 

GTCACTGTATGCTGAAT«3AACA^ 

^TGAAGATGCTTTTGATGCCCTCCCTCCATCTCTrc^ 

Laacctcctggctccagtagccggccatcctc 

CAAGTGGCCAAGTTCCTTT6CCTCC06CTAGAAGGTTACCAGQTQAAAATGTCAAAACTAA 

Ltcagcttccttcatgttcagatggttcacaggcaccagccagaccccctaaaccacgaccgcgcaggactgcaccag 
aattcaccacagaaaaccccatgggcctgaggcggcattggaaartgtcgatgcaa^ 

STTATGCCrTTGAAGAGGTGAAGAGAGCCTTAGAGATAGCCCAGAATAATGTCGAAGTTGC 
^TTGCCTTCCCTCCTCC^GTATCCC^^ 

TCTATTCCAAGAGTGTGGAAATAAAGAGAACTGAGATGGAATTCAAGAGAGAAGTGTCTCCTCCTCGTGTAGCAGCTTG 

Saagaggcttgggagtgcagcttctcaaagaaaaccgatgcttgctcaggatgtcnac^gctgnggnctnccttgtttt 

GCTAGCCATTTTTTTAAATNAGGGTTGAACTNGANAAAANTATTTAAAAACGTTTACCTCCCTTGAACTTTGAACCTGG 
G 

AAAGNC 

Human Cbl-b Protein sequence - var 7 (SEQ ID NO: 361) „„„ T , T 
MRKHRRHDLPLEGAKVSSNGHI^SEEYDVPP^ 

g 

LNSQPSHCHNVKPPVRSCI5NGHCMIjNGTHGPSSEKKSNIPDLSIYLKGEDAFDAIjPP 

SRPSSGQDLFLIiPSDPFVDLASGQVPLPPARRIiPGENVKTNRTSQDYDQLPSCSDGSQ 
P 

HGPEAALENVDAKI AKLMGEGYAFEEVKR^E I AQNNVEVARS I I»RE FAFPPPVS PRLNL 

Human cbl-B clone3Gdll4 (partial sequence) (SEQ ID NO: 360) 

ACTCTGACCCAGTGCTTATGCGGAAACACAGACGCCATGATTTGCCTTTA 

GAAGGAGCTAAGGTCTCTTCCAATGGTCACCTTGGAAGTGAAGAATATGA 

TGTTCCTCCCCGGCTTTCTCCTCCTCCTCCAGTTACCACCCTCCTCCCTA 

GCATAAAGTGTACTGGTCCGTTAGCAAATTCTCTTTCAGAGAAAACAAGA 

GACCCAGTAGAGGAAGATGATGATGAATACAAGATTCCTTCATCCCACCC 

TGTTTCCCTGAATTCACAACCATCTCATTGTCATAATGTAAAACCTCCTG 

TTCGGTCTTGTGATAATGGTCACTGTATGCTGAATGGAACACATGGTCCA 

TCTTCAGAGAAGAAATCAAACATCCCTGACTTAAGCATATATTTAAAGGG 

TGAAGATGCTTTTGATGCCCTCCCTCCATCTCTCCCACCTCCCCCACCTC 

CTGCAAGGCATAGTCTCATTGAACATTCAAAACCTCCTGGCTCCAGTAGC 

CGGCCATCCTCAGGACAGGATCTTTTTCTTCTTCCTTCAGATCCCTTTGT 

TGATCTAGCAAGTGGCCAAGTTCCTTTGCCTCCCGCTAGAAGGTTACCAG 

GTGAAAATGTCAAAACTAACAGGACATCACAGGACTATGATCAGCTTCCT 

TCATGTTCAGATGGTTCACAGGCACCAGCCAGACCCCCTAAACCACGACC 

GCGCAGGACTGCACCAGAAATTCACCACAGAAAACCCCATGGGCCTGAGG 

CGGCATTGGAAAATGTCGATGCAAAAATTGCAAAACTCATGGGAGAGGGT 

TATGCCTTTGAAGAGGTGAAGAGAGCCTTAGAGATAGCCCAGAATAATGT 

CGAAGTTGCCCGGAGCATCCTCCGAGAATTTGCCTTCCCTCCTCCAGTAT 

CCCCACGTCTAAATCTATAGCAGCCAGAACTGTAGAC^CCAAAATGGAAA 

GCAATfcGATGTATTCCAAGAGTGTGGAAATAAAGAGAACTGAGATGGAAT 

TCAAGAGAGAAGTGTCTCCTCCTCGTGTAGCAGCTTGAGAAGAGGCTTGG 

GAGTGCAGCXTCTCAAAGAAAACCGATGCTTGCTCAGGATGTCGACAGCT 

GTGGCTTCCTTGTTTTTGCTAGCCATTTTTTTAAATCAGGGTTGAACTGG 

AAAAAATTATTTAAAAACGTTTACCTCCCTTGAACTTTGAACCTGGGAAA 
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Human 'CblB protein in 3Gdll4 Translation of cbl-B clone3Gdll4 starting 
at base pair 3 (SEQ ID NO: 3 98) 

^SDPVLMRKHRRHDLPLEGAK^SSNGHLGSEEYDVPPRLSPPPPVTTLIiPS 
IKCTGPLJU^SLSEKTRDPVEEDDDEYKIPSSHPVSLNSQPSHCHNVKPPV 

"rscdnghcmlngthgpssekksnipdlsiylkgedafdalppslpppppp 

ARHSLIEHSKPPGSSSRPSSGQDLFLLPSDPFVDIASGQVPIiPPAR^PG 
ENVKTNRTSQDYDQLPSCSDGSQAPARPPKPRPRRTAPEIHHRKPHGPEA 

Human CBL-B Protein sequence - varl (public gi: 4757920) (SEQ ED NO: 227) 

MANSMNGRNPGGRGGNPRKGRI LGI IDAIQDAVGPPKQAAADRRTVEKTW^ 

ppYILDILPDTYQHLRLILSKYDDNQKLAQLSENEYFKIYIDSLMKKSKRAIRLFKEGK^ 

l^TKLSLIFSHMLAEIKAIFPNC^FQGDNFRITKADAAEFWR^^ 

LEAMALKSTIDLTCNDYI SVFEFDI FTRLFQPWGS ILRNWNFLAVTHPGYMAFLTYDEVKARLQKYSTKP 
GS YI FRLSCTRLGQWAIGYVTGDGNI LQTI PHNKPLFQALIDGSREGFYLYPDGRS YNPDLTGLCEPTPH 
DHI KVTQEQYELYCEMGSTFQLCKI CAENDKDVKI EPCGHLMCTSCLTAWQESDGQGCPFCRCE I KGTEP 

iivdpfdprdegsrccsiidpfgmpmldldddddreeslmmnrlanvr^^ 
kpqpdplqiphlslppvpprldliqkgivrspcgsptgspksspcmvrkqdkplpapppplrdppppppe 

RPPPIPPDNRLSRHIHHVESVPSiyDPPMPLEAWCPRDVFGTNQLVGCRLLGEGSPKPGITASSNVNGR^ 
RVGSDPVLMRKHRRHDLPLEGAKVFSNGHLGSEEYDVPP^ 

PVEEDDDEYKI PSSHPVSLNSQPSHCHNVKPPWSCDNGHCMLNGTHGPSSEKKSNI PDLSI YLKGTYRI 

Human CBL-B Protein sequence - var2 public gi: 23273909) (SEQ ID NO: 228) 

MANSMNGRNPGGRGGNPRKGRI LGI I DAI QDAVGPPKQAAADRRTVEKTWKLMDKVVRLCQNPKLQLKNS 

PPYILDILPDTYQHLRLILSKYDDNQKIAQIiSENEYFKIYIDSLM^ 

RNLTKLSLIFSHMLAEIKAIFPNGQFQGDNFRITKADA^ 

LEAMALKSTIDLTCNDYI SVFEFDI FTRLFQPWGS I LRNWNFLAVTHPGYMAFLTYDEVKARLQKYSTKP 
GS YI FRLSCTRLGQWAIGYVTGDGNI LQTI PHNKPLFQALIDGSREGFYLYPDGRS YNPDLTGLCEPTPH 
DHIKVTQEQYELYCEMGSTFQLCKI CAENDKDVKIEPCGHLMCTSCLTAWQESDGQGCPFCRCEI KGTEP 
I IVDPFDPRDEGSRCCS I IDPFGMPMLDLDDDDDREESLMMNRLANVRKCTDRQNSPVTS PGS SPLAQRR 
KPQPDPLQIPHLSLPPVPPRLDLIQKGIVRSPCX3SPTGSPKSSPCMVRKQDKPLPAPPPPLRDPPPPPPE 
RPPP I P PDNRLSRHIHHVESVPSKDPPMPLEAWCPRDVFGTNQLVGCRLLGEGS PKPGI TAS SNVNGRHS 
RVGSDPVLMRKHRRHDLPLEGAKVFSNGHLGSEEYDVPPRLSPPPPWTLLPSIKCTC 
PVEEDDDEYKI PSSHPVSLNSQPSHCHNVKPPVRSCDNGHCMLNGTHGPS S EKKSNI PDLS I YLKGDVFD 
SASDPVPLPPARPPTRDNPKHGSSLNRTPSDYDLLIPPLGEDAFDALPPSLPPPPPPARHSLIEHSKPPG 
SSSRPSSGQDLFLLPSDPFVDLASGQVPLPPARRLPGENVKTNRTSQDYDQLPSCSDGSQAPARPPKPRP 
RRTAPEIHHRKPHGPEAALENVDAKI AKLMGEGYAFEEVKRALE I AQNNVEVARS ILREFAFP PPVS PRL 
NL 

Human CBL-B Protein sequence - var3 (public gi: 862407) (SEQ ID NO: 229) 

MANSMNGRNPGGRGGNPRKGRI LGI IDAI QDAVGPPKQAAADRRTVEKTWKLMDKVVRLCQNPKLQLKNS 
PPYILDILPDTYQHLRLILSKYDDNQKLAQLSENEYFKTYIDSLMKKSKRAIRL 

RNLTKLSLI FSHMLAEI KAI FPNGQFQGDNFRI TKADAAEFWRKFFGDKTIVPWKVFRQCLHEVHQISSS 
Id^AMALKSTIDLTCNDYI SVFEFDI FTRLFQPWGS I LRNWNFL^ 

GSYI FRLSCTRLGQWAIGYVTGDGNI LQTI PHNKPLFQALIDGSREGFYLYPDGRS YNPDLTGLCEPTPH 
DHIKVTQEQYELYCEMGSTFQLCKI CAENDKDVKI EPCGHLMCTSCLTAWQESDGQGCPFCRCEIKGTEP 
I IVDPFDPRDEGSRCCS I IDPFGMPMLDLDDDDDREBSLMMNRLANVRKCTDRQNSPVTS PGSSPLAQRR 
KPQPDPLQIPHLSLPPVPPRLDLIQKGIVRSPCGSPTGSPKSSPCMVRKQDKPLPAPPPPLR^ 
RPPPIPPDNRLSRHIHHVESVPSRDPPMPLEAWCPRDVFGTNQLVGCRLLGEGSPKPGITASSNVNGRHS 
RVGSDPVLMRKHRRHDLPLEGAKVFSNGHLGSEEYDVPPRLSPPPPVTTLLPS I KCTGPLANSLSEKTRD 
PVEEDDDEYKI PSSHPVSLNSQPSHCHNVKPPVRSCDNGHCMLNGTHGPSSEKKSNI PDLSI YLKGDVFD 
SASDPVPLPPARPPTRDNPKHGSSLNRTPSDYDLLIPPLGEDAFDALPPSLPPPPPPARHSLIEHSKPPG • 
SSSRPSSGQDLFLLPSDPFVDLASGQVPLPPARRLPGENVKTNRTSQDYDQLPSCSDGSQAPARPPKPRP 
RRTAPE IHHRKPHGPEAALENVDAKI AKLMGEGYAFEEVKRALE I AQNNVEVARS I LREFAFPPPVS PRL 
NL 

Human CBL-B Protein sequence - var4 (public gi: 862409) (SEQ ID NO: 230) 

MANSMNGRNPGGRGGNPRKGRI LGI IDAIQDAVGPPKQAAADRRTVEKTWKLMDKVVRLCQNPKLQLKNS 
PPYILDILPDTYQHLRLILSKYDDNQKIAQLSENEYFKIYIDSI^K^ 
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M^TKLSLIFSHM^IKAI^^ 

LEAMALKSTIDLTCNDYI SVFEFDI FTRLFQPWGSIIiRNWNFIiAVTHPGYMAFLT5fDEVKARLQK3fSTKP 
GSYIFRLSCTRLGQWAIGYVTGDGNIIK^IPHNKPLFQALIDGSREGFYLYPDGRSYNPDIiTGIiCEPTPH 
DHI KVTQEQYELYCEMGSTFQLCKI CAENDKDVKI EPCGHLMCTS CIjTAWQESDGQGCPFCRCE I KGTE P 
I IVDPFDPRDEGSRCCS I IDPFGMPMLDLDDDDDREESIiMMNRIjANVRKCTDRQNSPVTS PGS SPLAQRR 
KPQPDPI^IPHLSLPPVPPRLDLIQKGIVRSPCGSPTGSPKSSPCMVRKQDKPIiPAPPPPLRDPPPPPPE 

RPPPIPPDimLSRHIHHVESVPSRDPPMPLEAWCP^ 
•RVGSDPVLMRKHRRHDLPLEGAKVFSNGHIXSSEEYDVPPRLSPPPPVTTI^ 

PVEEDDDEYKI PS SHPVSLNSQPSHCHNVKPPVRSCDNGHCMIjNGTHGPS SEKKSNI PDLSI YLKGDVFD 
SASDPVPLPPARPPTRDNPKHGSSIjNRTPSDYDLLI pplg 

Unicrene Name; CENTB1 Unigene ID: Hs • 33 7242 

Human CENTB1 mRNA sequence - varl (public gi: 495679) (SEQ ID NO: 37) 

ggggtgagagctcctcctaggacacccctttccccttggggaaagaattgtgcccccaggcccttccccg 

cggaggtccctctcctccttccccctcatctccccttcctgggacagaaagtgcctccacctgcatcccc 

aggggcccggcctccagggcccgctggccccacagcaggcaagctgagatgacggtcaagctggatttcg 

aggagtgtctcaaggactc^ccccgtttccgagcctctattgagctggtggaa 

ggagacccgtctggaaaagctcctgaaactgggcactggtctcctggaaagtgggcgccattaccttgct 

gccagccgcgccttcxsttgtcxsgcatttgtgacctggcccgcctgggtcc^ccagagcccatgato^ 

agtgtctggaaaaattcaccgtgagcctgaacc^caagctggacagccatgcggagcttctagatgcc^c 

cc^c^cacactgc^gcagcagatccagaccctggtcaaggaaggtctgcggggtttccgagaggctc^ 

cgggatttctggcggggggctgagagcctggaggctgccctgacccacaacgcagaggttcccaggcgcc 

gggcccaggaggcagaagaggcaggagctgctttgaggacggctcgagctgggtaccggggacgggcact 

ggattatgccctgc^gatcaacgtgattgaggacaagagga^ 

ttggtggaggcccaggctacccatttcc^gcagggccatgaggagctgagccggctgtcccagtatcgaa 

aggagctgggcgcccagttgcaccagctggtcttgaattcagcacgagagaagagggacatggagcagag 

acacgtgctgctgaaacagaaggagctgggtggggaggagccagaaccaagcttaagagaggggcctggt 

ggcctogtgatggaaggacatctcttcaaacxsggc 

tcaccattcagagcaacc^ctggtttacc^^ 

ccttcgtctctgca(^gtgaaactctgccctgactcagaaaggcggttctgctttgaggtggtgtccacc 

agc^gtcctgcctcctccaggctgactcagagcgcctcctgcagctgtgggtcagtgct 

gcattgcttctgccttcagtcaggctcgccttgatgacagcccccgggg 

cctggccataggctctgctgcc^ccctgggctctggtggaatggccaggggaagggagcctgggggagtc 

gggcacgtggtggcccaggtccagagtgtggatggcaatgcccagtgctgcgactgccgggagccagccc 

cggagtgggccagcatcaaccttggtgtcaccctctgcattcagtgttccggcatccacaggagccttgg 

tgttcacttctccaaagtccggtctctgacccttgactcatgggagccagaactagtgaagctcatgtgt 

gagctgggaaatgtcatcatcaaccagatctatgaggcccgcgtggaggccatggcagtgaagaaacca 

ggcccagctgctcccx3gcaggagaaggaggcctggattcacgctaaatacgtggagaagaagttcctgac 

caagctgcctgagattcgagggcgaagaggtggccgggggcgcccaagggggcagcctcctgtgccccca 

aagccttcc^tcaggccccggcc^gggagcttgagatccaagccagagcccccctctgaggacctgggaa 

gcctgcaccctggggccctactgtttcgagcgtotgggcatcctccatctcttcccaccatggctgato 

ccttgcccatggagctgatgtcaactgggtcaatgggggccaagataatgccacaccgctgatccaggcc 

AGAGCTGCTAATTCTCTTCTGGCCTGTGAGTTTCTCCTCCAGAACGGGGCGAACGTGAACCAAGCGGACA 
GTGCGGGCCGGGGCCCGCTGCACCACGGAACCATTC 

ACGGGGAGCTGATCTGGGGGCTCGAGACTCTGAAGGCAGGGACCCTCTGACCATCGCCATGGAAACAGCC 

AACGCTGACATCGTCACCCTGCTACGACTGGCAAAGATGAGGGAGGCTGAAGCGGCCCAGGGGCAGGCAG 

GAGATGAGACGTATCTTGACATCTTCCGCGACTTCTCCCTCATGGCGTCAGACGACCCGGAGAAGCT 

CCGTCGCAGTCATGACCTCCACACGCTGTGACCCGAGGCCCACGGGGCCCGCGCCTGCCTCCCTTCCCCG 

CCACCGGGCCCTCTGCCATTAAAGCCTCCGTGCTTCGCTCTTCC 

Human CENTB1 mRNA sequence - var2 (public gi: 173912 88) (SEQ ID NO: 38) 

GAGCTCCTCCTAGGACACCCCTTTCCCCTTGGGGAAAGGATTGTGCCCCCAGGCCCTTCCCCGCGGAGGT 
CCCTCTCCTCCTTCCCCCT<^TCTC^ 

CGGCCTCCAGGGCCCGCTGGCCCCACAGCAGGCAAGCTGAGATGACGGTCAAGCTGGATTTCGAGGAGTG 
TCTCAAGGACTCACCCCGTTTCCGAGCCTCTATTGAGCTGGTGGAAGCCGAAGTGTCAGAATTGGAGACC 
CGTCTGGAAAAGCTCCTGAAACTGGGCACTGGTCTCCTGGAAAGTGGGCG 

GCGCCTTCGTTGTCGGCATTTGTGACCTGGCCCGCCTGGGTCCACCAGAGCCCATGATGGCGGAGTGTCT 
GGAAAAATTCACCGTGAGCCTGAACCACAAGCTGGACAGCCATC 

ACACTGCAGCAGCAGATCCAGACCCTGGTCAAGGAAGGTCTGCGGGGTTTCCGAGAGGCTCGCCGGGATT 

TCTGGCGGGGGGCTGAGAGCOTGGAGGCTGCCCTGACCCACAACGCAGAGGTTCCC^ 

GGAGGCAGAAGAGGCAGGAGCTGCTTTGAGGACGGCTCGAGCTGGGTACCGGGGACGGGCACTGGATTAT 
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CTGCTGAAACAGAAGGAGCTGGGTGGGGAGGA 
TGATGGAAGGACATCTCTTCAAACGGGCCAGC 

CTCTCCA^GTGAAACTCTGCCCTGACTCAGAAAGGCGGTTCT 

cct^ctcctcc^ggctgactcagagcgcctcctgcagctgtgggtc^gtgctgtgcagagcag^^ 
ttctgccttc^gtcaggctcgccttgatgacagcccccggggtc^ggccagggctcaggacacctggcc 

ataggctctgctgccacc^^ 

tggtcgccc^ggtcc^^^ 

ggccagcatcaaccttggtgtcaccctc 

TTCTCCAAAGTCCaGTCT^^ 

GAAATGTCATCATCAACCAGATCTATGAGGCCCGOT^ 

ctgctcccggcaggagaaggaggcctggatt^^ 

cctgagattcgagggcgaagaggtggccgggggcgcccaagggggcagcctcctgtgcccccaaagcctt 

CCATCACK3CCCCGGCCAGGGAGCTTGAGATCC^GCCAGAGCCCCCCTCTGAGGACCTGGGAAGCCTGCA 
CCCTGGGGCCCTACTGTTTCGAGCGTCTGGGCATCCTCCATCT 

CATGGAGCTGATGTCAACTGGGTCAATGGGGGCCAAGATAATGCCACACCGCTGATCCAGGCCACAGCTG 
CTAATTCTCTTCTGGCCTGTGAGTTTCTCCTCCAGAACGGGGCGAACGTGAACCAAGCGGACAGTGCGGG 
CCGGGGCCCGCTGCACCACGCAACCATTCTTGGCCACACGGGGCTCGCCTGCCTGTTCCTGAAACGG^A 
GCTGATCTGGGGGCTCGAGACTCTGAAGGCAGGGACCCTCTGACCATCGCCATGGAAACAGCCAACGCTG 
ACATCGTCACCCTGCTACGACTGGCAAAGATGAGGGAGGCTGAAGCGGCCCAGGGGCAGGCAGGAGATGA 
GACGTATCTTGACATCTTCCGCGACTTCTCCCTCATGGCGTCAGACGACCCGGAGAAGCTGAGCCGTCGC 

AGTCATGACCTCCAC^^ 

GCCCTCTGCCATTAAAGCCTCCGTGCTTCGCTCAAAAAAAAAAAAAAA 

Human CENTB1 mRNA sequence - var3 (public gi: 34533014) (SEQ ID NO: 39) 

ATGTCAGCGTTGGCTGTTTCCATGGCGATGGTCAGAGGGTCCCTGCCTTCAGAGTCTCGAGCCCCCAGAT 
CAGCTCCCCGTTTCAGGAACAGGCAGGCGAGCCTGGAGAGAAGAGCCAGGGTC^GCCGGCCGCCC^ACTT 
CTCCC^GCCTTCCTCCCC^TGCCATC^TCCCTACCCCGTGTGGCCAAGAATGGTTGCGTGGTGCAGCGGG 
CCCCGGCCCGCACTGTCCGCTTGGTTCACGTTCGCCCCGTTCTGGAGGAGAAACTCACAGGCCAGAAGAG 
AATTCTGCATGGAGAAGTCGAGAAGGGGGGTTGAGGGTGGCATCCCTAGTGGTGGATTTCAAGATGTCTT 
AGGGTGGCGCCAGTTCAGAGAATGGGAGGGTGGAGTGTGGTAATCAGGAGTGTGGAAGGGGTTACAGCTA 

ACTGTAACCAAGCTAGGCTTGGCTCTAGCTCT 

TGAGGTATGTTACTATTTTACAGATGAGGCTGAGAGGTTAATAACTTGTTAAAAGTCTCCTGTAGGCCGG 

GCACAGTGGCTCACGCCAGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACAGGGTCAGGAGA 

TCCAGACCATCCTGGCTAGCACGGTGGAGCCCTATCTCTACTAACAATACAAGAAATTAGCCGGGCATGC 

TGGCTGGCGCCTGTGGTCCCAGCTACTCGGGCAGCTGAGGCAGGAGAATGGTGTGAACCCGGGAGGCGGA 

GCTTG(^GTGAGC03AGATCG(^CCATTGCACTCCGGCCTGGGGGACGGAGCGAGACTGTCTCAA^ 

;y\AAAAAAGTCTCCTGTAAGAGGTGAGAGCCTGGGTTCAAACTCAGGTTCTCTGCCTCCAAATCACACAC 

TCTTAGCAACCAGTCTCTATTGTTGATCTCT 

AGGAGGTAAGGGCAGGGCCCAGAGTCAGGAGTAGGTGTCAGAGCCCTAGGGTGGGGTGGAGAGGTCAGCA 
GGGCTCTTACAGCAGCTGTGGCCTGGATCAGCGGTGTGGCATTATCTTGGCCCCCATTGACCCAGTTGAC 
ATCAGCTCCATGGGCAAGGGCATCAGCCATGGTGGGAAGAGATGGAGGATGCCCAGACGCTCGAAACAGT 
AGGGCCCCAGGGTGCAGGCTTCCCAGGTCCTCAGAGGGGGGCTCTGTTCGGGGGATTTGGTTCTGTTAGG 
GGGAAGCAGCTCCGAGTCTGGGAAGAAAACCCTCAGCAGTGTCCCAATGCTATAATGGGACAGGTCTCTT 
CTAAATGATGGGGAGCTTGGGACTGTGGAGGGAATAGAGTGATGCAAGTGTGGGTATGTGTAAGTATGCG 
TATGCATGTGTACGAGTCCCTAGGGTGTGGGGGAGAGACGGCATCATCACCTCATCTGGTCCAACCACAC 

TTGGCCTCAGCTCTC^CCCCT<^CG 
TTGGCACCCCTTACCCTCCCTGCCCACGrc 

CGTAACCCCTGAAACGGC^GTCCGGTCCCTCCGACATTGTCCAGCGGAAGGCCTGGGCTTCACACTCTGT 

GCCTCCCGGCGCTACCTGGCACGATGCCGAGCACACAGC^^ 

ACCTGGCTTGGATCTCAAGCTCCCTGGCCGGGGCCTGATC 

CTTGGGCGCCCCCGGCCACCTCTTCGCCCTCGAATCTCAGGCAGCTTGGTCAGGAACTTCTTCTCCACGT 
ATTTAGCGTGAATCCAGGCCTCCTTCTCCTGCCTGTGGGAGGGGAGAAGCACGCAGTCTTCCCTCTTCTG 
CTCCAGGGGTCCCCCATTCCCCTGGGAGGCTAAACCCCAAGCTCACCGGGAGCAGCTGGGCCCTGGTTTC 
TTCACTGCCATGGCCTCCACGCGGGCCTCATAGATCTGGTC^ 

GCTTCAGGAGGCCCAGGCAGAGGCAGAGACAGGGAAGGTGGGGGTGAGTGACTCCTCAGGGATCACGCCC 
CTGCACCGCCATGTCCTTGCCCCACCCCAAGTTCTT^^ 

ACTAGTTCTGGCTCCCATGAGTCAAGGGTCAGAGACCGGACTTTGGAGAAGTGAACACCAAGGCTCCTGG 
AGGGCCAGAGGGGGAGGGTCAGGCCCTGTGCAGGGGGGCAGTGGCCTGGGGAGCTGCTGCTGCTCCTGAA 
GACACTGGGAGGCAAGGCTGGCATGGGGGCCCGTGCAGAGGTGCTGGCCCAGGAGGCAGGGCAGCTGCGG 
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rr AT GTAACCGCCATGTAGCCTTGACCTG^ 
AGGTTTCATTTCAAGGCCCTCATCACTCCAGCC^ 

T^ACAACCTCTCCCATGTCACCTCCCTTCC^CCACACTGAGGTGGGGGGCGAGGGCCTT 

TOAGGfcCTGATGACCGTTTCTCTGCCTAG 
OTCTTCCAACCAAGCCAACAAATTC^ 

• GGCCCCTCCTGGGTCC^^ 

• GCTAAGAGAGGAAGCCAGGACAGGGAAGGAAGAGGGGCCGGTGTGATGCGCTCTGTATTGGAGCCGCACT 
^GTGGCCCGAAGGAGTGGGGCTCCCGCATGGGCCTTGTGGAGTAACCTGTGGATGCCGGAACACT 

AGAGGGTGACACCAAGGTTGATGCTGGCCCACTCCGGGGCTGGCTCCCGGCAGTCGCAGCACTGGGCAT^ 
GCCATCC^CACTCTGGACCTGGGCC^CCACGTGCCCGACTCCCCC^GGCTCCCrrTCCCCTGGCC^TTC^ 
CCAGAGCCCAGGGTGGCAGCAGAGCCTATGGCCAGGTGTCCTGAGCCCTGGGGGAGAGAGGGGAAGA^G 
GGTGGCCAAGGGGCCTAGGGTAAAGGGTGCCCCATOTCCACAGGCAGCCTGGCTCCGCACCCCCAGGCTA 

AGGTACCTGGCCTGGACCCCGGGGGCTGTCATCAAGGCGAGCCT^ — 
TGCACAGCACTGACCCACAGCTGCAGGAGGCGCTCTGAGTCAGCCTGGAGGAGGCAGGACCTAGGTAGGA 

GGGTGAGGGAGATGGCAGAGGGGTCTGAGGCCTGGGAAGCAAAGTGGCAGCATGGGCAGACTGACATTCTV 

GCCAGTATTCAACCAGTTCCAGTTGCATTGAAAGACTTCTGTACCAGTTGGTAATATTCTCCTAAATATC 

CCCCATCACCCTGTACCCTCTTCCACAATGGCCCCCCAGTCCAGCCGCCAA 

GAGCTGCATGGGGGGCTTCCATTOTGGTGGGCCCTGCCm 

GTATCACCCCTGGGGATTGCACTCACTTGCTGGTGGACACCACCTCAAAGCAGAACCGCCTTTCTGAGTC 
AGGGCAGAGTTTCACTGTGCAGAGACGAAGGTCATCCACCACCACAGTCACAGGGTCCTGGCAGGATAAG 
GTGATAAGGGGCCAGATGTCCAGCTGCAGGCAAGAGCTGAGTCTCCCTGGGGCCCAGGCATCCAGGACCC 
AGGTCCACTCACCTTGTACTTCTTCTGGTAAACCAGTTGGTTGCTCTGAATGGTGAACCAGCGTCTGTAA 
GAGAAGGAAATCATTACAGACATAGGCAGCTTTAGGATGAGGGACGGAAGAGAGGCTGTGCTTTTTGCCC 
ATGAGGATCTTACTGAGAGGACAGACACCTGGGCTGACTGTTCCACX^GACATTCCAGAGAAGGGTGGAC 
AATTGTGCAGATTGGAACATCTAAAGGATGCTATTCCTATCTTGGACAACCCAGATTTGATATAGTTATG 
AAGACAACTTTCC^GCAGATGGCAGTAAAATTCTTTTTCTAATAAAATGTCTATTGCTAC^TTTAAAAA 
ATACTATTTAGGCTGGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCTGATGGGGGTGGTGGATCGCC 
•CGAGGTCAGGAGTTTGAGACCACCCTGACCAATATGGTGAAACTCCGTCTCTACTAAAAATACAAAAATT 
AGCCAGGCGTGGTGGCAGGCGGCTATAATCCCACCTACTTGGGAGGCTGAGGCGGGAGAATCGCTTGAAC 
CCAGGAAGCTGAGGTTGCAGTGAGCTGGGATCGCACCACTGTGCTGCAGCCTGCGCAACATAGCGAGGCT 
CCATGAAAAAAGAAAAAAAAAAGAAAAAGAAAAAAAGAAAAGAAAGAATCTO 

CTCACGCCTGTAGTCCCAGCAAGTTGGGAGGCCGAGGCGGGTGGATTGCTTGATGTCAGGAGTTTGCAAC 
CAGCCTGGGCAACATGGTGAAACCCTGTTTCTACCAAAZ^TACAAAAATTAGCCGAGCGTGATGGCACGC 
GCCTGTGGTCCCAGCTGTTTAGGATGCTGAGGAGGGAGGATCACTTGAACTCAGGGGATAGAGGTTGCAG 
TGAGCCGAGACTGCGCCACTGCACTGCAGGCTGGGCAACAGAGTGACACCCCATCTCAAAAAW 

Human CENTB1 mRNA sequence - var4 (public gi: 32 879918) (SEQ ID NO: 40) 

ATGACGGTCAAGCTGGATTTCGAGGAGTGTCTCAAGGACTCACCCCGTTTCCGAGCCTCTATTGAGCTGG 
TGGAAGCCGAAGTGTCAGAATTGGAGACCCGTCTGGAAAAGCTCCTGAAACTGGGCACTGGTCTCCTGGA 

CCACCAGAGCCCATGATGGCGGAGTGTCTGGAAAAATTCACCGTGAGCCTGAACCACAAGCTGGACAGCC 
ATGCGGAGCTTCTAGATGCCACCCAACACACACTGCAGCAGCAGATC 

GCGGGGTTTCCGAGAGGCTCGCCGGGATTTCTGGCGGGGGGCTGAGAGCCTGGAGGCTGCCCTGACCCAC 

AACGGAGAGGTTCCCAGGCGCCGGGCCCAGGAGGGAGAAGAGGCAGGAGCTGCTTTGAGGACGGCTCGAG 

CTGGGTACCGGGGACGGGCACTGGATTATGCCCTGCAGATCAACGTGATTGAGGACAAGAGGAAGTTTGA 

CATCATGGAGTTTGTGCTGCGTTTGGTGGAGGCCCAGGCTACCCATTTCCAGCAGGGCCATGAGGAGCT 

AGCCGGCTGTCCCAGTATCGAAAGGAGCTGGGCGCCCAGTTGCACCAGCTGGTCTTGAATTCAGCACGAG 

AGAAGAGGGAC^TGGAGCAGAGAC^a3TGCTGCTGAAAQ\GAAGGAGCTGGGTGGGGAGGAGCCAGAACC 

AAGCTTAAGAGAGGGGCCTGGTGGCCTGGTGATGGAAGGACATCTCTTCAAACGGGCCAGCAACGCATTT 

AAGACCTGGAGCAGACGCTGGTTCACCATTCAGAGCT^CCAACTGGTTTACCAGAAGAAGTACAAGGACC 

CTGTGACTGTGGTGGTGGATGACCTTCGTCTCTGCACAGTGAAACTCTGCCCTGACTCAGAAAGGCGGTT 

CTGCTTTGAGGTGGTGTCCACCAGCAAGTCCTGCCTCCTCCAGGCTGACTCAGAGCGCCTCCTGCAGCTG 

TGGGTCAGTGCTGTGCAGAGCAGCATTGCTTCTGCCTT 
GTCCAGGCCAGGGCTCAGGAC^CCTGGCGATAGGCTCTGCTGC^ 

GGGAAGGGAGCCTGGGGGAGTCGGGCACGTGGTGGCCCAGGTCCAGAGTGTGGATGGCAATGCCCAGTGC 
TGCGACTGCCGGGAGCCAGCCCCGGAGTGGGCCAGCATCAACCTTGGTGTCACCCTCTGCATTCAGTG 
CCGGCATCCACAGGAGCCTTGGTGTTCACTTCTCCAAAGTCCGGTCTCTGACCCTTGACTCATGGGAGCC 
AGAACTAGTGAAGCTCATGTGTGAGCTGGGAAATGTCATCATCAACCAGATCTATGAGGCCCGCGTGGAG 

GCCATGGCAGTGAAGAAACCAGGGCCGAGCTGCTCCCGGC^ 
ACGTGGAGAAGAAGTTCCTGACCAAGCTGCCTGAGATTCGAGC^ 

GGGGCAGCCTCCTGTGCCCCCAAAGCCTTCCATCAGGCCCCGGCCAGGGAGCTTGAGATCCAAGCCAGAG 
CCCCCCTCrGAGGACCTGGGAAGCCTGCACCCTGGGGCCCTACTGTTTCGAGCGTCTGGGCATCCTCCAT 

CTCTTCCCACCATGGCTGATGCCCTTGCCCATGGAGCTGATG 
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TGCCACACCGCTGATCCAGGCC^^ 

GCGAACGTGAACCAAGCGGACAGTGCGGGCCGGGGCCCGCTGCACCArc 
C^GACGACCCGGAGAAGCTGAGCCGTCGCAGTCATGACCTCCACACGCTGTAG 

EAAQGQAGDETYLDI FRDFSLMASDDPEKLSRRSHDLHTIi 

w.n-nan PFNTBl Droleinseauence- var2 (public gi: 34533 015) (SEQ ID NO: 232) 
p^sawSpfwri^sqarrefcmeksp^gveggipsggfqdvlgwrqfreweggvw 

Sgag^ 

GAATCGTAGATACTGAA^CCCCGCAAGTTC^CTTCAACTGTGCATTCGTGC 

Human CENTB1 pray sequence - var2 (SEQ ID NO: 42) ■ 
cSgSccttccm^ 



CAACCGATGGNCTGATGCCCTTTCGCGCCATGGTAGCTTGATGTCAACCTAGGT^ 

cctnaaggataaattgctcgtcgaccagacobgctatccaaaggcac^ataatctagctaattcgttacg 



TTCTTGG 



gSgSgSSSgtc^tggggg^ 

CTCTTCTGTCCT^ 

rrrocTOCACCACGCAACCATTCTTGGCCACACGGGGCTCGCCTGCCTGTTCCTGAAACGGGG^ 
C^GGGGGCTCGAGACTCTGAAGG^ 

TCACCCTGCTACGACTGGCAAAGATGAGGGAGGCTGAAGCGGCCCAGGGGCAGGCAGGAGATGAG^ 
TC^CMCTTCCGCGACTTCTCCCTCATGGCGTCAG^ 

TGNCATNAAAGCCTNCGNGCTTCNAAAAAAAAAAAAAAAAAAAA 

Human CENTB1 pray sequence - var4 (SEQ ID NO: 44) 

CCGGCCATGGAGTACCATACGACGTACAGTATTACAGCTACATATGGCCATGGAGGCCAGTGAATTCCAC 
CGCAATGCAGTGGTATCAACGCATGCAGATGGACCATTATGGCCGGGGTGGGCACGTCCGTCCATGATGT 
CCCCCAAAGGCCTTCCATCAGGCCCCGTGGCAGAGGAGGCTTGAGATCCAAGCCAGAGCCCACCCTCTGA 
GGACCTGGGAAGCCTGCACCCNGGCGGCCCTACTGTTTCGAGCGTCTGGGACATACTCCATCATCTTCCC 



Figure 36 part - 22 



p C T/US0«*/0630B 



======= 

GCTGATCTGGGGCGCTCGACGACTCTGAAG 



TGGAGAAGAAGTTCCTGACCAAGCTGCCGTGAGAATTCGAGGGCGAAGAGGTC 
GGGCAGCCTCCTGTGCCAGCCCTAAAGCCTTCCATCATGGCCCCGCGTCCAAGGAGCTTGAGATC 

^AGTAGGCCCCCTCTGACGGACCTAGGGAAGCCTGCTACCC^ 

TGGGCATCCTCCATCTCTTTCCACCATGGCCTGATGCCCTTGCCCATOT 

TGGGTGGCCAAGATATATGCCACACCGCTGATCQ 

NTNTTCTCCTCCAGAACGGTGGCGACACGTGAACCCAAGCGGNCAGTGCCGGC 

Human CENTB1 pray sequence - var6 (SEQ ID NO: 46) ™,™„™v* 

GGCCATGGAGTACCATACGACGTACAGATTACGCTCATATGGCCATGGAGGCCAGTGAATTCCACCGCAA 
GCAGTGGTATCAACGCATGAGATGGACCATTATGGGGGGCAGTGCCATGGGCAGCTGAAGAAATCCANGC 
CCAGCTGCTCCCGGCAGGAGAAGGAGGCCTGGATTCACGGCATAATAGTAGCAGCTGGAGTAAGAAGTTC 
CTGTAXCCAAGTCTGCCCTGACGAATTCGAGGTGGCGAAGTATGGTGGCCGGGGGCAGTCCTCGAAGGAG 
GGTCAGCCACTCCTGGTGCCGCCACGAACATGCCCGTTTCCATACACGCGTCCCCGGCCCACGGGATGGC 
ATTGAGATCCACATGCACAGAGCCCCGCCTCTGAGGACCTGTGAGCAAGCTCATGGCAACCCTGGGGACC 
CTAGCGTAGTATTCTGAGCCAGTCTGGGCAATCGCTTCACATCTCTTCTCCACGCATGAGCATGATGCGC 
GCTTTGACCCATGGAGCTAGATGTGAACTGGGTCAATGGGGTGCGAA 

CAAGGCCTACAGCTGCTAACGTTCCTCTTCTGGCCTGTGAGTTTCTCCTCTCAGAACGGGGCGAACTGTG 
^AGCCC^AGCGTGACAGTGCGGGCCCGGGGCCCGACTGCGCCACGCAATCCACTTCTTGGCCNGCAACNT 

GGGCTCGNCTTGCCTTGTTTCCTTGATCAC 

Human CENTB1 pray sequence - var7 (SEQ ID NO: 47) 

CNCGGCATGGAGTACCATACGACGTACAGATTACGCTGATATGGCCATGGAGGCT^GTGAATTCCACCC^ 

GCAGTGGTATCAACGCATGAGTGGACCATTATGGGGGAAGCTCATGTGTGAGCATGGGAAATAGTCATCA 

TCAACCAAGATCTATGAGGCCCGCGTGGAGGCCATGGCAGTGAAGAAACCAGGGCCCAGTCTGCTCCCGG 

CAGGAGAAGGAGGCCTGGATTCACGCTAAATACGTGGAAGAAGAAGTTCCrGACCAAGCTGCCTGA 

CX3ATGGCGANGAGGTGGCCGGGGGCGCCCAANGGGGCAGNCTCCTGTGCCCCCAAAGCCTTCCATCAGGC 

CCCAGGCGCAGGGAGCTTGAGATCCAATGCCAGAGCCCCCCGTCTGAGGACCTGGGAAGCCTGCACCCTG 

GGGCCCTACTGTTTCGAGCGTCTGGGCSVrCCT^ 

AGCTGATGTCAACTGGGTCAATCaJGGCGGCCAAGATAATGCCATCACCGACTGATCCAGGCCACAGCCTG 
CTAANTTCTACTTCTGGCCGTGTGAGTTTCTCCTCCAGGAACGGGGCGAACCGTGGACCAAGGCGGACNN 
GTGCGGGCCGGGGCCCGCTGCCACCACGCCAACCATTCTTGGCATACGGGGCTCGCCT 

Unigene Name: DDEF1 Unigene ID; Hs . 386779 

Human DDEF1 mRNA sequence - varl (public gi: 31873727) (SEQ ID NO: 48) 

GAGACAAAGTTTACAAAAATTGAGAAAGAGAAAAGAG 

AGATAACAGGAGCTGAGATTGCGGAAGAAATGGAGAAGGAAAGGCGCCTCTTTCAGCTCCAAATGTGTGA 
ATATCTGATTAAAGTTAATGAAATCAAGACCAAAAAGGGTGTC 
TACCATGCACAGTGCAATTTCTTTCAAGATGGCTTC 
AACTGGCTGCTGATTTATATAATATAAAACAGACCCAGGATC 

AGACTTAATAAAATCCTCT CTTCAACTGGATCAGAAAGAAGATT CTCAGAGCCGGCAGGGAGGATACAGC 
ATGCATCAGCTCCAGGGCAATAAGGAATATGGCAGTGAAAAGAAGGGGTACCTGCTAAAGAAAAGTGACG 
GGATCCGGAAAGTATGGCAGAGGAGGAAGTGTTCAGTCAAGAATGGGATT CTGACCATCT CACATGCCAC 
ATCTAACAGGCAACCAGCCAAGTTGAACCTTCTGACCTGCGAAGTAAAACCTAATG 
TCTTTTGACCTGATATGACATAATAGAACATATC 

GGATATCAGTATTGACAAATAGCAAAGAAGAGGCCCTAACCATGGCCTTCCGTGGAGAGCAGAGTGCGGG 

AGAGAACAGCCTGGAAGACCTGACAAAAGCCATTATTGAGGATGTCC^VGCGGCTCCCAGGGAATGACATT 

TGCTGCGATTGTGGCTCATCAGAACCCACCTGGCTTTC^CCAACTTGGGTATTTTGACCTGTATAGAAT 

GTTCTGGC^TCCATAGGGAAATGGGGGTTCATATCTCTCGCATTCAGTCTTTGGA 

AACTTCTGAACTCTTGCTGGC(^^GAATGTAGGAAACAATAGTTTT^ 

CCC^GCCCCTCACCAAAACCCACCCCTTCAAGTGATATGACTGTA^ 

ATGTAGATCATAGGTTTTCAAGGAAGACCTGTTCAACTTCATCAGCTAAACTAAATGAATTGCTTGAGGC 
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PCT/USOt/06308 

CCCCAAACCACAGCTGGGAGACCTGCTAGCAAAATCCCAGAC^GGAGATGTCTCACCCAAGG 

======= 

===== 

Sa^c^gttcctaKcaatgcagtattctcaattgtc^ 
SSSacaaaa^ 

Sc^catoactactttgacaaaagtttaaaacacatatga^ 

SSaISStagcSctacattacgcatgcatttagaaac 

Sg^gataagtggaattgtacatt^^ 

ACAATATTTGTTTCTAATCACTG 

?c^gatgtS?^^ 

S^SaaLgattattcaa^ 

gaaSgg^ga^ 

TCC^GTCTTAAGCTCTGTCCACCrrCAAGTCAATTCATAACCAAGTTTTTGAACGCTCCTATG^ 

CTGTGAAAAGCACTCTTCCCTCTCAGTTTTCTTTrCATCCCAGCCATGTTTATCAGATCCTTAAGAACAT 

TGTATTTCAGTCTTTTACATCAGTCTGAATTTTGGAAAAGAATGCyVATAGTTGTACTCCACJWTCAGTGG 

AA^TTCCCTGAGTCCGAGGCTCATGTGTCATTCTGGCACTACATTTGCTTAAATTGCTAmGGCAA 

CAGCACAGAAAACTAATATTTTTAAGCAGAGAATCTTGGCAATGAGTGAGAGATGTTAATTTCACAGAAG 

CACAACTCCCAACCCAACCCTTAGGAAAAGCCCTCTT^^ 

GTTCTGCTTAAGTGGTTGCTATACAAACTTTGAATAGCCACCTAATAA 

GCAAAATATTTTATCAGCTGTTATTGGAAAGTGATTTTAAGQVATTGCTTCCTCAGTGTCA 

TGAATTTCCACACCAAACAGAGCATGAGGAACCAGTTGACATGCTGGGTTGTGACTGGCAGCTCTAGCAG 

CCTCGGTACTGAAGCCACACCAGTGTCCGGATGGAAGTCTGCATCTGAGGTTGCTCAGTCTCCCGGTCAT 

TCATTTACAG&TTTTAACTTGCATTAAAGAGCTGTTC 
AAAATAAACTGGTTTTTTTCAAAAAAAAAAAAAAAAAAAA 

Human DDEF1 raRN A sequence - var2 (public gi: 63 3 o 853) (SEQ ID NO: 49) 

GAAAAGAGAGCACGCAAAACAACATGGGATGATCCGCACAGAGATAACAGGAGCTGAGATTGCGGAAGAA 
ATGGAGAAGGAAAGGCGCCrCTTTCAGCTCCAAATGTGTGAATATCTCATTAAAGTTAATGAAATCAAGA. 
CCAAAAAGGGTGTGGATCTGCTGCAGAATCTTATAAAGTATTACCATGCACAGTGCAATTTCTTTCAAGA 
TGGCTTGAAAACAGCTGATAAGTTGAAACAGTACATTGAAAAACTGGCTGCTGATTTATATAATATAAAA 
GAGACCCAGGATGAAGAAAAGAAACAGCTAACTGCACTCCGAGACTTAATAAAATCCTCTCTTCAACTGG 
ATCAGAAAGAATCTAGGAGAGATTCTCAGAGCCGGCAAGGAGGATACAGCATGCATCAGCTCCAGGGCAA 
TAAGGAATATGGCAGTGAAAAGAAGGGGTACCTGCTAAAGAAAAGTGACX3GGATCCGGAAAGTATGGCAG 
AGGAGGAAGTGTTCAGTCAAGAATGGGATTCTAACCATCTCACATGCCACATCTAACAGGCAACCAGCCA 
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AGAACCC^ 

atggggottcm 

A^AA^CCTGTTCAAC 

TOGCACTAA^CAAGTCTATG CAGAAGGGGTAGAGCTAATGGAAC ^^ G ^^^'^^^^^ ( ^5^^^ . 

ACAGTAAACCTGAGTGTTTGAAGCTTTTGCTC AGGAG CAAGCC CACTGTGGATAT AGTTAACCAGGCTGG 
AGAAACTGCCCTAGAGATAGCAAAGAG 

TCTGGAAAGTTCAATCCACACGTCGACGTAGAATATGAGTGGAATCTTCGACAGGAGGAG^ 

gSat^gatctggatgacaaaccaagcc^ 
cSctcctc^gca^^^ 

ccacggaggctccccctctgcctcctaggaacgcc^^ 
tctaagcacccagacctctagtggcagctccaccctatccaac^gaggcctcct 

ggggtaacgatgggggtccatcctcttcaagtaagactacaaac^gtttgagggactm 

GAGCACCAGTTCTGC1AAAGACTGCCCTT0K3CCCAAGAG 

AGGAAAACAGATC^TCTCTCCCTAGACAAAGCCACC^TCCCGCCCGAAATCTTTCAGAAA 
TGGCAGAGTTGCCACAAAAGCCACCACCTGG^ 

CCAAATTGGAGATTTGCCGCCTAAGCCAGGAGAACTGCCCCCCAAACCACAGCTGGGGGACCTGCCACCC 
AAACCCCAACTCTGAGACTTACCTCCCAAACCACAGATGAAGGACCTGCCCCCCAAACCACAGCTGGGAG 

&CCTGCTAGCAAAATCCGAGACTGGAGATGTCT 

GTCAGACCCATTGGATCTATCCCCAAATGTGCAGTCCAGAGACG 

TCCAACGACCTCACGCCTACTCTGCCAGAGACGCCCGTACCACTC 

ATAAAGTGAGGCGAGTG^GACCATTTATGACTGCC^GGCAGACAACGATGACGAGCTCACATTCATCGA 
GGGAGAAGTGATTATCGTCACAGGGGAAGAGGACCAGGAGTGGTGGATTGGCCACATCGAAGGACAGCCT 
GAAAGGAAGGGGGTCTTTCCAGTGTCCTTTGTTCATATCCTGTCTGACTAG 
ATTGTCCACATCCTTCATGCAAGACTGCTGCCTT 

TTACAGAGTAAGAAACTCATGGAAGGGCCACCTCAGGAGGGGGATATAATGTGTGTTGTAAATATCCTGT 

GGTTTTCTGCCTTCACCAGTATGAGGGTAGCCTCGGACCCGGCGCGCCTTACTGGTTTGCCAA^ 

CTTGGCATCTAGCACTTACATCTCTCTATGCTGTTCTACAAGCAAACAAACAAAAATA 

CTGCTGGCTTTGCAAATAGAAGTGGTCTCCAGCAACCGTTGAAAGGCATAGAATTGACTCTGTTCCTAAC 

AATGCAGTATTCTCAATTGTGTTACTGAAAATGCAACATTAGCAAAGAGGTGGGTTCTGTTTTCCAGGTG 

AAACTTTTAGCTCCATGACAGACCAGCCTGTAGTTATCTGTGTACACAGTTTACAGCTACAAAAACCT 

TTTGGTATTTATTACAGAAAAGTGCTCAGTTAAATGTAAGTGTTATTCCTTCA^ 

CCAAAACTCTTTATGGCATTTTACAATGCACACAGCCT 

CTTATGAGTGCCCGCCCCTGATATATTACCTCATTATGCAAAAAT^ 

CAAAAGTTTAAAACACATATGAAGTTCAAATTTCAGGAACCAAGGACT 

ATTACGCATGCATTTAGAAGCTTACCTGAAATCTGCCTTTTATAAAGGAATAGTATGGATAAGTGGAATT 
GTACATTTTTTAAACTTGATTGCCATTA^ 

CTGG CTTTCTCAAGAGTATGGATTGACATATTGTGTTATGAATGCACATCTCTCAGATGTGTTGAAG CAT 
CCATTGCATCCATTTTTTATTATTTTCTTAGTTTTGTTCTTGGACAAAT^ 

CAAGATGAATTTAAAAGTCAACCCTTCACACAGTTTCCCTACTGTATGTAGAATCCAGGTGCTGAAACC^ 
AGTGTTTCTTTTCCCATGCTCTTTGTTAAACCCCAATTATAGATAATTTTTCC^GTCTTAAGCTCTGTCC 

ACCTTTCAAGTCAATTCATAACCAAGTTTTTGA 

TCTCAGTTTTCTTTTCATCCCAGCCATGTTTATCAGATCCTTAAGAACATTGTATTTCAGTCTTT^ 
CAGTCTGAATTTTGGAAAAGAATGCAATAGTTGTACTCCACAGTCAGTGGAACTGOT 
GCTCATGTGTCATTCTGGCACTACATTTGCTTAAATTGCTATTTTGGCAACAGCACAGAAAACT 
TTTAAGCAGAGAATCTTGGCAATGAGTGAGAGATGTTAATTTCACAGAA 

TTAGGAAAAGCCCTCTTCCATCGTTACAGTGCTCAGTGAATATTAATTTAGTTCTGCTTAAGTGGTTGCT 

ATACAAACTTTGAATAGCCACCTAATAAATAAACCT^ 
TTATTGGAAAGTGATTTTAAGCAATTGCTTCCTCAGT 

AGCATGAGGAACCAGTTGACATGCTGGGTTGTGACTGGCAGCTTTAGCAGCCTCGGTA 
CAGTGTCCGGATGGAAGTCTGCATCTGAGGTTGCTCAGTGTCCCGGTCATTCATTTACACATTTTAACTT 

GCATTAAAGAGCTGTTCnn?TTCTGTGG^ 

AAAAAAT^AAAAAAAAACAAAAACAAAAAAAAAACACAAAAGCTGCATGT C 
GTCTATTCTTTTTCCCCTTTTGCAGCAACTTACACAGCATTTTTAACACCTTTTTTTTCT^ 
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TTCGGTTTTGTTTTCCATCAGGAATTT^ 
AGTAGAAATACCTTTGGTGATCTTGTT^ 

CTGCAATTTTGTACATTATATAGTATTTTGAAGATATGGAACCTTATGA 
CTTTTTCCCCCAGAGGGGAAAGTTATGTTCT^ 

TGCTATTTATTTTTTTATTGCCTAGAACTTCAACATGTTGTATAGGAATCCTGTAGTGCCACTAGTTAAA 

TGCCGAATTCTCATCTGGATGTTACCATCAAACATCAGTACACTTGTCATTTCACATC^ 

CAGTTTTTCAGTACTGTATGTGTTAATTTCTACTT 

TTCTCAGTTGATCCC 

Human DDEF1 raRNA sequence - var3 (public gi: 7689053) (SEQ ID NO: 50) 

GATTG CCATTAAAGCAGA^TTATAAGGTTGCAACAATATTTGTTT CTAATCACTGGCTTTCTCAAGAGT 
ATGGATTGACATATTGTGTTATGAATGCACATCT^ 

TATTATTTTCTTAGTTTTGTTCTTGGACAAATTTAAACANNTTAAAAGATTATTCAAGATGAATTTAAAA 

GTCAACCCTTCACACAGTTTCCCTACTGTATGTAGAATCCAGGTGCT 

TGCTCTTTGTTAAACTCCAATTATAGATAATTT 

ATAACCAAGTTTTTGAACGCTGCTATGAATTGCACTGTGAAAAGCACTCTTCCCTCTCAGTTTTCGTT^ 
TCCTGAGCCAGAATCAAAAAAAAAAA 

Human DDEF1 mRNA sequence - var4 (public gi: 16552319) (SEQ ID NO: 51) 

CAGT^CCTTAAGATTGTCCACATCCTTCATGCAAGACTGCTGCCTTCATGTAACCCTGGGCAC^GTGTGT 
ATATAGCTGCTGTTACAGAGTAAGAAACTCATGGAAGGGCCACCTCAGGAGGGGGATATAATGTGTGTTG 
TAAATATCCTGTGGTTTTCTGCCTTCACCAGTATGAGGGTAGCCTCGGACCCGGCGCGCCTTACTGGTTT 

GCCAAAGCCATCCTTGGCATCTAGCACTTAGATCT 

GGAGTATAGGAACTGCTGGCTTTGCAAATAGAAGTGGTCTCCAGCAACCGTTGAAAGGCATAGAATTGAC 
TCTGTTCCTAACAATGCAGTATTCTCAATTGTGTTACTGAAAATGCAACATTAGCAAAGAGGTGGGTTCT 
GTTTTCCAGGTGAAACTTTTAGCTCCATGACAGACCAGCCTGTAGTTATCTGTGTACACAGTTTACAGCT 

rACAAAAACCTACTTTGGTATTTATTACAGAAAAGTG 
ATATTCACTGACCCAAAACTCTTTATGGCATTTT^ 

GATTTATACTGTCTTATGAGTGCCCGCCCCTGATATATTACCTCATTATGCAAAAATAACATATCTTTCA 

TGACTATTTTGACAAAAGTTTAAAACACATATC 
ATTAGCCTCTACATTACGCATGCATTTAGAAGCTTACCTGAAAT 

ATAAGTGGAATTGTACATTTTTTAAACTTGATTGCCATTAAAGCAGAAATTATAAGGTTGCl^ 

TGTTTCTAATCACTGGCTTTCTCAAGAGTATGGATTGACATATTGTGTTATGAATGCAC^ 

GTGTTGAAGCATCCATTGCATCCATTTTTTATTAT 

TAAAAGATTATTCAAGATGAATTTAAAAGTCAACCC^ 

GTGCTGAAACC^GTGTTTCTTTTC 

TAAGCTCTGTCCACCTTGAAGTCAATTCATAACCAAGTCT 

AGCACTCTTCCCTCTCAGTTTTCTTTTCATCCCAGCCATGTTTATCAGATCCTTAAGAACATTGTATTTC 

AGTCTTTTACATCAGTCTGAATTTTGGAAAAGA^ 

CCTGAGTCCGAGGCTCATGTGTCATTCTGGGACTAC^ 

AAAACTAATATTTTTAAGCAGAGAATCTTGGCAATGAGTGAGAGATGTTAATTTCACAGAAGCACAACTC 
CCAACCCAACCCTTAGGAAAAGCCCTCTTCCATCGTTACAGTGCTCAGTGAATATTAATTTAGTTCTGCT 
TAAGTGGTTGCTATACAAACTT TGAATAGC CACCTAAT AAATAAAC CTTG CATGACAAACCTGCAAAATA 
TTTTATCAGCTGTTATTGGAAAGTGATTTTAAGCAA 
CACACCAAACAGAGCATGAGGAACGAGTTGACATGCTGGGTTGTG 

CTGAAGCCACACCAGTGTCCGGATGGAAGTCTGCATCTGAGGTTGCTCAGTGTCCCGGTCATTCATTTAC 

ACATTTTAACTTGCATTAAAGAGCTGTTCTTTTCT 

CTGGTTTTTTTC 

t 

Human DDEF1 mRNA sequence - var5 (public gi: 18088817) (SEQ ID NO: 52) 

GAGCTACAAAAACCTACTOTGGTATTTATTAC^^ 
GCAAAATATTC^CTGACCCAAAACTCTTTATGGCATTTTACAATG 

AAGTGGATTTATACTGTCTTATGAGTGCCCGCCCCTGATATATTACCTCATTATGCAAAAATAACATATC 

TTTCATGACTATTTTGACAAAAGTTTAAAACAC^TATGAAGTTCAAATTTCAGGA^ 

AAAATATTAGCCTCTACATTACGCATGCATTTAGAAGCTTACCTGAAATCTGCCTTTTATAAAGG 

TATGGATAAGTGGAATTGTACATTTTTTAAACTTGATTGCCATTAAAGCAGAAATTATAAGGTTG 

ATATTTGTTTCTAATCACTGG CTTTCTCAAGAGTATGGATTGACATATTGTGTTATGAATGCACAT CTCT 

CAGATGTGTTGAAGC^TCCATTGCATCCATTTTTTATTATTTTCTTAGTTOT 

ACTTTTAAAAGATTATTCAAGATGAATTTAAAAGTCAAC^ 

TCCAGGTGCTGAAACCAAGTGTTTCTTTTCCCATGCTCTTT^ 

AGT CTTAAG C T CTGT C CACCTTC AAGTCAATT CATAAC C AAGTTTTTGAACG C TG CTATGAAT TG CACTG 
TGAAAAGCACTCTTCCCTCTCAGTTTTCTTTT^^ 
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ATTTCAGTCTTTTACATCAGTCTGAATTTTGGAAAAGAATGCAMAGTT 
TGTTCCCTGAGTCCC^ 

CACAGAAAACTAATATTTTTAAGCAGAGAATCTTGGCAATGAGT 
SSd^CC^CCCTTAGGAAAAGCCCTCTTCCATCGTTACAGTG 

aaaS^ . . 

tc^tacacattttaacttgcattaaagagct 

ATAAACTGGTTTTTTACAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

HumaB DDEF1 raRN A sequence - var6 (Predicted by Proteologics) (SEQ ID NO: 53) 

TTTCGACGCGTGGGTTTTATTCCCTT^^ 
GCCTCAAGAAGGATGGAGGAGCCATCATAAGAGTGACAAGGCT 

AAAGTGAAGAAGTCTGTAAAAGCAATATATAATTCTGGTCAAGATCATGTACAAAATGAAGAAAAC 

CACAAGTTCTTGATAAGTTTGGGAGTAATTTTTTAAGTCGAGACAACCCCGACCTTGGCACCGCGTTTGT 

CAAGTTTTCTACTCTTACAAAGGAACTGTCCACACTGCTGAAAAATCTGCTCCAGGGTTTGAGCCACAAT 

GTGATCTTCACCTTGGATTCTTTGTTAAAAGGAGACCTAAAGGGAGT^ 

TTGACAAAGCCTGGAAAGATTATGAGACAAAGTTTACAAAAATTGAGAAAGAGAAl^AGAGAGCACGCAAA 
ACAACATGGGATGATCCGCACAGAGATAACAGGAGCTGAGAT 

CTCTTTCAGCTCCAAATGTGTGAATATCTCATTAAAGTTAATGAAATCAAGACCAAAAAGGGTGTGGATC 
TGCTGCAGAATCTTATAAAGTATTACCATGCACAGTGCAATTTCTTTCAAGATGGCTTGAAAACAGCTGA 
TAAGTTGAAAGAGTACATTGAAAAACTGGCTGCTGATTTATATAATATAAAACAGACCCAGGATGAAGAA 
AAGAAACAGCTAACTGCACTCCGAGACTTAATAAAATCCTCTCTTCAACTGGATCAGAAAGAATCTAGGA 

GAGATTCTCAGAGCCGGCAAGGAGGATACAG CATG CATCAGCTC CAGGG CAATAAGGAATATGGCAGTGA 
AAAGAAGGGGTACCTGCTAAAGAAAAGTGACGGGATCCGGAAAGTATGGCAGAGGAGGAAGTGTTCAGTC 

AAGAATGGGATTCTAACCATOTCACATGCCACATCTAAC 
* GCCAAGTAAAACCTAATGCCGAAGACAAAAAAT CTTTTGACCTGATATCACATAATAGAACATAT CACTT 
TCAGGCAGAAGATGAGC7VGGATTATGTAGCATGGATATCAGTATTGACAAATAGCAAAGAAGAGGCCCTA 
ACCATGGCCTTCCGTGGAGAGCAGAGTGCGGGAGAGAACAGCCTGGAAGACCTGACAAAAGCCATTATTG 
AGGATGTCCAGCGGCTCCCAGGGAATGAC^TTTGCTGCGATTGTGGCTCATCAGAACCCACCTGGCTTTC 
AACCAACTTGGGTATTTTGACCTGTATAGAATGTTCTGGCATCCATAGGGAAATGGGGGTTCATATTTCT 
CGCATTCAGTCTTTGGAACTAGACAAATTAGGAACTTCTGAACTCTTGCrGGC 

ATAGTTTTAATGATATTATGGAAGCAAATTTACCGAGCCCCTCACCAAAACCCACCCCTTCAAGTGATAT 
GACTGTACGAAAAGAATAT ATCACTGCAAAGTATGTAGATCATAGGTTTT CAAGGAAGAC CTGTTC AACT 
TC^TCAGCTAAACTAAATGAATTGCTTGAGGCCATCAAATCCAGGGATTTACTTGCACTAATTCAAGTCT 
ATGCAGAAGGGGTAGAGCTAATGGAACC^CTGCTXSGAACCTGGGCAGGAGCTTGGGGAGACAGCCCTTC^ 
CCTTGCCGTCCGAACTGC^GATCAGACATCTCTCCATTTGGTTGACTTCCTTGTACAAAACTGTGGGAAC 

CTGGATAAGCAGACGGCCCTGGGAAACAGAGTTCT 

TGAAGCTTTTGCTCAGGAGCAAGCCCACTGTGGATATAGTTAACCAGGCTGGAGAAACTGCCCTAGACAT 

AGCAAAGAGACTAT^AAGCTACCCAGTGTGAAGATCTGCTTTCCCAGGCTAAATCTGGAAAGTTCAATCC^ 

CACGTCCACGTAGAATATGAGTGGAATCTTCGACAGGAGGAGATAGATGAGAGCGATGATGATCTGGATG 

ACAAACCAAGCCCTATCAAGAAAGAGCGCTCACCCAGACCTCAGAGCTTCTGCCACTCCTCCAGG^ 

CCCCCAGGACAAGCTGGCACTGCCAGGATTCAGCACTCCAAGGGACAAACAGCGGCTCTCCTATGGAGCC 

TTC^CCAACCAGATCTTCGTTTCCACAAG^ 

TGCCTCCTAGGAACGCCGGGAAAGGTCC^CTGGCCCACCTTCAACACTCCCTCTAAGCACCCAGACCTC 

TAGTGGCAGCTCCACCCTATCCAAGAAGAGGCCTCCTCCCCCACCACCCGGACACAAGAGAACCCTATCC 

GACCCTCCCAGCCCACTACCTCATGGGCCCCCAAACAAAGGCGCAGTTCCTTGGGGTAACGATGGGGGTC 

CATCCTCTTC^GTAAGACTAC^^CAAGTTTGAGGGACTATCCCAGCAGTCGAGCAC 

GACTGCCCTTGGCCCAAGAGTTCTTCCTAAACTACCTCAGAAAGTGGCACTAAGGAAAACAGATCATCTC 

TCCCTAGACAAAGCCACCATCCCGCCCGAAATCTTTCAGAAATCATCACAGTTGGCAGAGTTGCCACAAA 

AGCCACCACCTGGAGACCTGCCCCCAAAGCCCACAGAACTGG 

GCCTAAGCGAGGAGAACTGCCCCCCAAACCACAGCTGGGGGAC 

TTACCTCCCAAACCACAGATGAAGGACCTGCCCCCCAAA^^ 

AGACTGGAGATGTCTGACCCAAGGCTGAGCAACCCTCTGAGGT 

ATCCCCAAATGTGCAGTCCAGAGACGCCATCCAAAAGCAAGCATCT 

ACTCTGCCAGAGACGCCCGTACCACTGCCCAGAAA71ATCAATACGGGGAAAAATAAAGTGAGGCGAGTGA 

AGACCATTTATGACTGCCAGGCAGACAACGATGACGAGCTCACATTCATCGAGGGAGAAGTGATTATCGT 

CACAGGGGAAGAGGACCAGGAGTGGTGGATTGGCCACATCGAAGGACAGCCTGAAAGGAAGGGGGTCTTT 

CCAGTGTCCTTTGTTCATATCCTGTCTGACTAGCAAAACGCAGAACCTTAAGATTGTCGAC^ 

GCAAGACTGCTGCCTTCATGTAACCCTGGGCACAGTGTGTATATAGCTGCTGTTACAGAGTAAGAAACTC 

ATGGAAGGGCCACCTGAGGAGGGGGATATAATGTGTGTTGTAAATATCCT 

GTATGAGGGTAGCCTCGGACCCGGCGCGCCTTACTGGTTTGCCAAAGCCATCCTTGGCATOTAGCACTTA 
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CATCTCTCWGCTGTTCTACAAGCAAACAAACAAA^^ 
AAAGTG^TCAGT^AAM 

AGOTACCTGAAATCTGCCTTTTATAAAGGAATAGTATGGATAftGTG^^ 

attcccattaaag^agaaattataaggttgcaacaatatttgtttcta 
Sattga^ttgtgttatgaatg^catctctcacbatgtgttc 

actatottcttagttttgttcttggacaaatttaaacttttaaaagattattcaagatgaatttaaaag^ 
caacccttcacacagtttccctactgtatgtagaatccaggtgctgaaaccaagtgtttcttttcccatg 

ctctttgttaaaccccaattatagataatttttccagtc^^ 

aaccaagtttttgaacgctgctatgaattgcactgtgaaaagcactcttccctctcagttttcttttcat 
cccagccatgtttatcagatccttaagaacattgtatttcagtcttttacatcagtctgaattttggaaa 
agaatgcaatagttgtactccacagtcagtggaactgttccctgagtccgaggctcatgtgtcattctgg 

CACTACATTTGCTTAAATTGCTATTTTGGCAACAGCACAGAAAACTAATATTrTTAAGCAGAGAATCTTG 
GCAATGAGTGAGAGATGTTAATTTCACAGAAGCA^CTCCCAACCC^ 

CATCGTTACAGTGCTCAGTGAATATTAATTTAGTTCTGCTTAAGTGGTTGCTATACAAACTTTGAATAGC 
CACCTAATAAATAAACCTTGCATGACAAACCTGCAAAATATTTTATCAGCTGTTATTGGAAAGTGATTTT 
AAGCAATTGCTTCCTCAGTGTCAGGGCACATGTGAATTTCCACACCAAACAGAGCATGAGGAACCAGTTG 
ACATGCTGGGTTGTGACTGGCAGCTTTAGCAGCCTCGGTACTGAAGCCACACCAGTGTCCGGATGGAAGT 
CTGCATCTGAGGTTGCTCAGTGTCCCGGTCATTCATTTACACATTTTAACTTGCATTAAAGAGCTGTTCT 
TTTCTGTGGCCTAGACTCTTTTCACTGATCTCAAAATAAACTGGTTTTTTTCAAAAAAAAAAAAAAAACA 
AAAACAAAAAAAAAACACAAAAGCTGCATGTCTAAAATTACATGGAGTTAGTGTCTATTCTTTTTCCCCT 
TTTGCAGCAACTTACACAGC^TTTTTAACACCTTTTTTTTCTAGTTTTTTTGTTCGGTTTTGTTTTCCAT 
"CAGGAATTTGAGTTCTCTCTAACCCAGCTTACTGTGGGACATAGGAAAACTCAGTAGAAATACCTTTGGT 
GATCTTGTTGAGTTTAAGTCTGATCTTGATCTTAAACTCAGTAAGCCACTATCTGCAATTTTGTACATTA 
TATAGTATTTTGAAGATATGGAACCTTATGAAAAAAAAATAGCAAATTAGTTCTTTTTCCCCCAGAGGGG 
AAAGTTATGTTCTGCAAATAGTGTGTGTCTTATTTTACTGTTGAACAGCAATTGCTATTTATTTTTTTAT 
TGCCTAGAACTTCAACATGTTGTATAGGAATCCTGTAGTGCCACTAGTTAAATGCCGAATTCTCATCTGG 
ATGTTACCATCAAACATCAGTACACTTGTCATTTCACATGTGTTTAATGTGACAGTTTTTCAGTACTGTA 
TGTGTTAATTTCTACTTTTTTTAATATTTAAAATTGCTTTTAAATAAACATATTCTCAGTTGATCCC 

Human DDEF1 protein sequence - varl (public gi: 3187372 8) (SEQ ID NO: 233) 

ETKFTKI EKBKREHAKQHGMIRTBI TGAEIAEEMEKERRIiFQWJMCEYIiIKVNEI KXKKGVDIiIiQNLIKY 
YHAQCMFFQDGDKTADKLKQYIEKLAADLVNIKQTQDEEKKQLTALRDLIKSSLQLDQKEDSQSRQGGYS 
MHQLQGNKEYGSEKKGYLLKKSIX3IRKVWQRRKCSVKNGILTISHATS^QPAKIiNLLTCQVKPNAEDKK 

S FDLI SHNRT YHFQAEDEQDYVAW I SVLTNSKEEALTMAFRGEQS AGENSLEDLTKAI I EDVQRLPGND I 
CCDCGSSEPTWLSTNLGILTCIECSGIHREMGVHISRIQSLEIjDKIjGTSELIiIjAKNVGNNSFNDIMEANIj 
PSPSPKPTPSSDMTVRKEYITAKYVDHRPSRKTCSTSSAKLNELLEAIKSRDLIiAIjIQVYAEGVEIjMEPIj 
LEPGQELGETALHIAVRTAE)QTSLHLVDFIjVQNCGNIiDKQTAIiGNTVIjHYCSMYSKPECIjKIiIjIjRSKPTV 
DIVNQAGETALDIAKRLKATQCEDIiLSQAKSGKFNPHVHVEYEWNLRQEEIDESDDDIiDDKPSPIKKERS 
PRPQSFCHSSSISPQDKIiALPGFSTPRDKQRIjSYGAFTNQIFVSTSTDSPTSPTTEAPPIjPPRNAGKGPT 
GPPSTLPLSTQTSSGSSTLSKKRPPPPPPGHKRTLSDPPSPLPHGPPNiraAVPWGNDGGPSSSSKTTNKF 
EGLSQQSSTSSAKTAI^PRVLPKLPQKVALRKTDHLSLDKATIPPEIFQKSSQUAELPQKPPPGDI.PPKP 
TEIiAPKPQIGDLPPKPGELPPKPQLGDIiPPKPQIiSDLPPKPQMKDIiPPKPQLGDLLAKSQTGDVSPKAQQ 
PSEVTLKSHPLDLSPlWQSRDAIQKQASEDSiroLTPTLPETPVPLPRKINTGKNKVRRVKTIYDCQADND 
DELTFIEGEVIIVTGEEDQEWWIGHIEGQPERKGVFPVSFVHILSD 

Human DDEF1 protein sequence - var2 (public gi: 63308S4) (SEQ ID NO: 234) 

KREHAKQHGMIRTEITGAEIAEEMEKERRDFQLQMCEYI»IKVNEIICrKKGVDIjLQNI»IKYYHAQCNFFQD 
GLKTADKLKQYIEKLAADLYNIKQTQDEEKKQIiTALRDLIKSSLQLDQKESRRDSQSRQGGYSMHQLQGN 
KEYGSEKKGYLIjKKSDGIRKVWQRRKCSVKNGILTISHATSNRQPAKIiNIjLTCQVKPNAEDKKSFDLISH 
NRTYHFQAEDEQDYVAWISVLTNSKEEALTMAFRGEQSAGENSLEDLTKAIIEDVQRLPGNDICCDCGSS 
EPTWIjSTNLGILTCIECSGIHREMGVHISRIQSIiEIiDKIjGTSELIjIiAKIIV'GNNSFNDIMEANIiPSPSPKP 
TPSSDMTVRKEYITAKYVDHRFSRKTCSTSSAKLNELLEAIKSRDLLAI.IQVYAEGVELMEPIjLEPGQEL 

GETALHIAVRTADQTSLHLVDFIiVQNCGNIiDKQTAIiGNTV^ 

ETAIiDIAKRLKATQCEDLI»SQAKSGKFNPHVHVEYEWNIjRQEEIDESDDDLDDKPSPIKKERSPRPQSFC 
HSSSISPQDKLALPGFSTPRDKQRLSYGAFTNQIFVSTSTDSPTSPTTEAPPLPPRNAGKGPTGPPSTLP 



Figure 36 part - 28 



p C T/US04/0630B 



GEVIIVTGEEDQEWWIGHIEGQPERKGVFPVSFVHILSD 

Human DDEF1 protein sequence - var3 (public gi: 7689054) (SEQ ID NO: 235) 

L 

Human DDEF1 protein sequence - var4 (public gi: 180 8 8818) (SEQ ID NO: 236) 

MNAHLSDVLKHPLHPFFIIFLVLFLDKFKLLKDYSR 

Human DDEF1 protein sequence - var5 (Predicted by Proteologics) (SEQ ID NO: 237) 
migqpqeacrshhksikaibqdrt^ 

EHAKQHGMI RTE I TGAE I AEEMEKERRLFQLQMCE YLI KVNE I KT ^^^ LL ^^^^H^^3^F^5!^p 
KTADKLKQYIEKI^LYNIKQTQDEEKKQLTiU^RDLIKSSLQLDQKESRRDSQSRQGGYSMHQWG^ 

YGSEKKGYLLKKSDGIRKVWQRRKCSVKNGILTISHATSNRQPAKLNLLTCQVKPNAEDKKSFDLISHNR 

TYHFQAEDEQDYVAWISVLTNSKEEALTMAFRGEQSAGENSLEDLTKAIIEDVQRLPGNDICCDCGSSEP 

TWLSTNLGILTCIECSGIHREMGVHISRIQSLEnDKLGTSELLIiAKNVG^SFNDIMEANLPSPSPKPTP . 

SSDMTVRKEYITAKYVDHRFSRKTCSTSSAKI^ELt.EAIKSRDLLALIQVYAEGVELMEPLLEPGOELGE 

TALHIAVRTADQTSLHLVDFLVQNCGNI^KQTALGNTVLHYCSMYSKPECLKLLLRSKPTVDIWQAGET 

ALDIAKRLKATQCEDLLSQAKSGKFNPHVHVEYEWNLRQEEIDESDDDLDDKPSPIKKERSPRPQSFCHS 

SSISPQDKLALPGFSTPRDKQRLSYGAFTNQIFVSTSTDSPTSPTTEAPPLPPRNAGKGPTGPPSTLPLS 

TQTSSGSSTLSKKRPPPPPPGHKRTLSDPPSPLPHGPPNKGAVPWGNDGGPSSSSKTTNKFEGLSQQSST 

"sSAKTALGPRVIiPKIiPQKVALRKTDHLSIjDKATIPPEIFQKSSQIjAELPQKPPPGDIiPPKPTEIAPKPQI 

GDLPPKPGELPPKPQLGDLPPKPQLSDLPPKPQMKDLPPKPQLGDLLAKSQTGDVSPKAQQPSEVTLKSH 

PLDLSPWQSRDAIQKQASEDSNDLTPTIjPETPVPLPRKINTGKNKVRRVKTIYDCQADNDDELTFIEGE 

VIIVTGEEDQEWWIGHXEGQPERKGVFPVSFVHILSD 

Human DDEF1 pray sequence - varl (SEQ ID NO: 54) 

GCX3CCGCCATGGTAGTACCCATACGACGTACCAGTATTACGCTCATATGGCCATGGCAGGCCAGTGAATT 
CCACACCAAGCAGTGGTATCAACGCAGAGTGGGCAC^AAAGCCACGCACGCTGGANGACCTGCCCCCAAC 
AGCCCACAGAAACTGGCCCCCAAGCCCCAAATTGGAGATTTGCCGCCTAAGCCAGGAGAACTGCCCCCCA 
AACCACAGCTGGGGGACCTGCCACCCAAACCCCAACACTCAGACTTACCTCCCAAACCACAGATGAAGGA 
CCTGCCCCCCAAACCACAGCTGGGAGACCTGCTAGCAAAATCCCAGACTGGAGATGTCTCACCCAAGGCT 
CAGCAACCCTCTGAGGTCACACTGAAGTCACACCCATTGGATCTATCCCCAAATGTGCAGTCCAGAGACG 
CCATCCAAAAGCAAGCATNTGAAGACTCCAACGACCTCACGCCTACTCTGCCAGAGACGCCCGTACCACT 
GCCCANAAAAATCANTACGGGGAAAANTAAHNTGAGGCGAGTGAAAACCTTTAATGACTGCCAGGCANAC 
ANNATGACAAGCTCNATTCNTCNAGGGANAAGTGTTATCGTNCAGGGAAAAAGNNCNGGATTGTGGGTCC 

NNCAATTTTCNTCCNNTNNTCNNACTTATTANAA™^ 
GA&AANAGGNNTTTNNNCAAGGANCNTNNNNTNGTTTOT 
NCNCNTTTTTNTNNAAAAAOTCMGNANNNNNNN^ 
NNTNNNGGGGGGNNNANACCCCCCC 

Uniqene Name ; BIF3S3 Uniqene ID : Hs. 58189 Clone ID : 3GD_18 
Human E1F3S3 mRNA sequence - varl (public gi: 23 5 13 7 9) (SEQ ID NO: 55) 

GAAAGATGGCGTCCCGCAAGGAAGGTACCGGCTCTACTGCCACCTCTTCCAGCTCCACCGCCGGCGCAGC 
AGGGAAAGGCAAAGGCAAAGGCGGCTCGGGAGATTCAGCCGTGAAGCAAGTGCAGATAGATGGCCTTGTG 
GTATTAAAGATAATCAAACATTATCAAGAAGAAGGACAAGGAACTGAAGTTGTTCAAGGAGTGCTTTTGG 
GTCTGGTTGTAGAAGATCGGCTTGAAATTACCAACTGCTTTCCTTTCCCTCAGCACACAGAGGATGATGC 
TGACTTTGATGAAGTCCAATATCAGATGGAAATGATGCGGAGCCTTCGCCATGTAAACATTGATCATCTT 
CACGTGGGCTGGTATCAGTCCACATACTATGGCTCATTCGTTACCCGGGCACTCCTGGACTCTCAGTTTA 
GTTACCAGCATGCCATTGAAGAATCTGTCGTTCTCATTTATGATCCCATAAAAACTGCCCAAGGATCTCT 
CTCACTAAAGGCATACAGACTGACTCCTAAACTGATGGAAGTTTGTAAAGAAAAGGATTTTTCCCCTGAA 
GCATTGAAAAAAGGAAATATCACCTTTGAGTACATGTTTGAAGAAGTGCCGATTGTAATTAAAAATTCAC 
ATCTGATCAATGTCCTAATGTGGGAACTTGAAAAGAAGTCAGCTGTTGCAGATAAACATGAATTGCTCAG 
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CCTTGCCAGCAGCAATCATTTGGGGAAGAATCTAi 



CAGTTGCTGATGGACAGAGTGGATGAAATGAGCCAA 

===== 

^aarTOTTGGAAGAAATATATTTGCATATTGAAAAGC^CAGAGGATTTCTTTAGTGTCATTGCCGAa i J. lb 
AAAAAAAAAAAAAAAAAAAA 

Human EIF3 S3 mRN A sequence - var2 (public gi: 21751901) (SEQ ID NO: 56) 
Iagaa^gaaat^ 

T^GCACG^CACGAR.GGGCA.CGTAGTTTGTTAGTGTCTGGAACCCTl 

GTTCTTGCTACACCACCATAGAAACTAACGTCACTCAGGAACCATTTGTCAGGGCAAAGGGTGCCAC^T 

g^cgSgcctagagtcagaattcctaaccttgagtcctggttttgccacaa^ 

CTTCGGCAACTCCCGAAGAAAGCTGGGTTCAACTTCCTCACTGTCAAACTGGTTGTAGGTCTAG 
SaAGTACTCTT^^ 

cacaaagcI^^ 

TCftAGTTCTCCCGTGGCTAACTGTCGAGT^ 

?acaSagg^ . 

CTGCCACCTCTTCCAGCTCCACCGCCGGCGCAGCAGGGAAAGGCAAAGGCAAAGGCGGCTCGGGA 

•agccgtgaagcaagtgcagatagatggccttgtggtattaaagataatcaaacattatcaagaagaagga 
Saaggaactgaagttgttcaaggagtgcttttgggtctggttgtagaagatccmctt 
^^StcccSagcacacagaggatc^tgctgactttgatgaagtccaatat 
gcggagccttcgccatgtaaacattgatcatcttcacgtgggctggtatcagtccacatactatggot 
ScgttacccggGcactcctggactctcagtttagttaccagcatgccattgaagaatctgtcgttctc^ 

™£ga1£ccataaa^^ 
StcLgaagtgccgattgtaattaaaaattcaca^ 

agtcagctgttgcagataaacatgaattgctcagccttgccagcagcaatcatttggggaag 

gttgctgatggacagagtggatgaaatgagccaagatatagttaaatacaacacatacatgag^ 

agtaaacaacagcagcagaaacatcagtatcagcagcgtcgccagcaggagaatatgcagcgccagagcc 

gagga£aa1£^ 

ggactcgctgctcattgcaggccagataaacacttactgccagaac^tcaaggagttcactgcccaaaac 
ttaggcaagctcttcatggcccaggctcttcaagaatacaacaactaagaaaaggaagtttccagaaaag 
aagttaacatgaactcttgaagtcacaccagggcaactcttggaagaaatatatttgcatattgaaaagc 

ACAGAGGATTTCTTTAGTGTCATTGCCGATTTTGGCTATAACAGTGTCTTTCTAGCGATAATAAAATAAA 
ACAAAATCTTG 

Human EIF3S3 mKNA sequence - var3 (public gi: 12653234) (SEQ ID NO: 57) 

GGCACGAGGATGGCGTCCCGCAAGG7VAGGTACCGGCTCTACTGCCACCTCTTCCAGCTCCACCGCCGGCG 
CAGCAGGGAAAGGCAAAGGCAAAGGCGGCTCGGGAGATTCAGCCGTGAAGCAAGTGCAGATAGATGGCCT 

tgtggtattaaagataatcaaacattatcaagaagaaggacaaggaactgaagttgttcaaggagtgctt 

TTGGGTCTGGTTGTAGAAGATCGGCTTGAAATTACCAACTGCTTTCCTTTCCCTCAGCACACAGAGGATG 
ATGCTGACTTTGATGAAGTCCAATATCAGATGGAAATGATGGGGAGCCTTCGCCATGTAAACATTGATCA 
TCTTCACGTGGGCTGGTATCAGTCCACATACTATGGCTCATTCGTTAGCCGGGCACTCCTGGACTCTCAG 

tttagttaccagcatgccattgaagaatctgtcgttctcatttatgatcccataaaaactgcccaaggat 
ctctctc^ctaaaggcatac^gactgactcctaaactgatggaagtttgtaaagaaaaggatttttcccc 
tgaagcattgaaaaaagcaaatatcacctttgagtacatgtttgaagaagtgccgattgtaattaaaaat 
tcacatctgatcaatgtcctaatgtgggaacttgaaaagaagtcagctgttgcagataaacatgaattgc 
tcagccttgccagcagcaatcatttggggaagaatctacagttgctgatggacagagtggatgaaatgag 
cc^gatatagttaaatacaacacatacatgaggaatactagtaaacaacagcagcagaaacatcagtat 
cagcagcgtcgccagcaggagaatatgcagcgccagagccgaggagaacccccgctccctgaggaggacc 
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(TCCAAACTCTTCAAACCACCACAGCCGCCTGCGAGGATGGACTCGCTGCTCATTGCAGGCCAGATAAA 

CCCAAAA.CT 

cagaaaag; 

TGAAAAGCi — 

TTTGGCTATAACAGTGTCTTTCTAGCCATAATAAAATAAAACAAAATCTTGAAAAAAAAAAAAAAAAAAA 



GGGCAACTCTrGGAAGAAATATATTTGCATATTGAAAAGCACAGAGGATTTCTTTAGTGTC^T^^^^ _ _ 



AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human EEP3S3 protein sequence - varl (public gi: 12653235) (SEQ ID NO: 238) 

R^^QRQSRGEPPLPEED L SKLFKPPQPPARMDSI.LIAGQINTYCQNIKEFTAQNLGKI,FMAQALQEy , 
NN 

Uniqene Name: .EPS8L2 Unicrene ID: Hs. 55016 

Human EPS8L2 mRJNA sequence - varl (public gi: 2i2646is) (SEQ ID NO: 58) 
Human bi 3CG ^ C cccgcaacctgtcgctcaggttcctcctctcc^ccccgcc 

rC CCACCCGCCCGCGGGAGACCTGGCGCCCCGGCCGAGGCGCGAACAGAC 



GTCGACGGCCATTACC^TCGCGAAACCCCGCAA^^ 

?ScccggcccScgagStcccacccgcccgcgggagacctggcgccccggccgag 

ScGCACCGGOGAG^ 

gg^cagaccgg^^ 

g?cc?g5gccgtgagctgctgccc^ 

mgagccccS 

SgSotaccacg^cagcacctggccacattcatcatggacaag 
cg^tccSaaS 

AGAAGATTCGGCAGCGGCAGTCCATCCTGCCTCCTCCCCAGGGCCCGGCGCCCAT^ 

ac?g?gc?ctgg!cgacItcgagtggtttgtggcccgg^ 

gaaccmcggaaaaaggggaagaagaagggcaagaaggcgcc^ 

cggcccccctctgagggcgagttcatcgactgcttccagaaaatc^ 

AGCTGCAGAAGCACATCCAGAACCCCAGCGCCGCGGAGCTCGTGCAOTTCCTCTTCGGG^ 

gatcgtcaacacctgcagtggccc^^ 

gtogaStccS^ 

^g^ggSccgtcccgagtg^ 

ggSg?ctcc?g?ggatgtgSgcaggaggccccctgggaggtg^ 

gaIStSgtccag^ 
?£c?gggggatocS 
gScaagSSS^ 

ACGTGCCCTGCAACATCCTAGGCGAGGCGCGACCGGAGGA^ 
GAAGTACTGGGGCCCCGCC^GCCCGACCCACAAGCTACC 

ATGCAGCACATGGACGAGGTCAAC(3ACGAGCTCATCGGGAAAATCAGCAACATCAGGG^ 
G^CACTTCCGCGTGGAGCGCAGCCAGCCCGTGAGGCAGCCGCTCACCTA^ 

CCGCGCCTGGCTGGAAGCCAAGGCCTT(^GCCCGCGGATCGTGGAGAACCTGGGCATCCTGACCGGGCOT 

CAGC^TCTC^ 
AG^ACCATGCAGAAGGCCT^^ 

GTTTCATTCCATGAATCAGAGGAGGGGGGAGGACAGCTAGGCCCAGCTGCCTO 
^GGAAGCCCACCCACAATGCATGGAGTATTATTTTTATATGTGTATGTATT^GTAT^^ 

GGGGGTGTGGTGCTGGCTAGAGGTCCCTGCCCCTGTCTGGAGGCACAACGC^ 
.TACCCAAGGCCTCAGCCCACACCAAGACTAATCTCAGCCAAACCTGCTGC^ 
TCCACCTTCTCTTGAGGCCACAGAACTCCCTGGGGCTGGGGCCTCTTTCTCTC 
GGGGGGTCCTGGCCCCTGTGATGCTCCCCCATCCC^ 

GCTGGAGCTCCAGGCTGGCCAGGCTGAACCTCGCACACACGCAGAGTTCTGCTC^ 

GAGGGGCTCCAGCAGGAGGCCGTGGGTGCCATTCGGGGGAAAGTGGGGGAACGACACACACTTCACCIUU 
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AAGGGCCGACAACGCAGGGGACACCGTGCCGGCTTCAGACACTCCCAGCGCCCACTCTTACAGGCCCA^ 

GGTGCCTGGACTTGAGTCTCACCCTAC^ 
GTTAAGTCTCCCAGGAAAAAAAAAAAAAAAA 

Human EPS8L2 protein sequence - varl (pubUcgi: 2i2646i6)(SEQIDNO:239) 

MSQSGAVSCCPGATNGSIX3RSDOTAKMSPKDLFEQ 
VDDAIRKLVQLSSKEKIWTQEMLLQVNDQSIj^ 

ODSEQSKPDVHFFHCTEVEAEIjVHEDIESAIiADCRIjGKKMRPQTLKGH 

qhrggdspeaknrvgpqvplsepgfrrresqe^^ 

KQJjNQRKKGKKKGKKAPAEGVLTIiRARPPSEGEFIDC 

i^livntcsgpdiarsvscpllsrdavdflrghlvpkemslweslgeswmrprsewprep 
sgweppvdvlqeapwevegi^apieevspvsrqsirnsqkhsptseptppgdalppvssphthrgyqpt 

pamakyvkilydftai^anelsvlkdevlevletcrqwwklrsrsgq 

AGQKYWGPASPTHKLPPSFPGNKDEI^QHMDEVNDELIRKISNIRAQPQRHFRVERSQPVSQPIiTYESGP 

DEVRAWI£AKAFSPRIVENIX3ILTGPQLFSI^^ 

MNKFHS MNQRRGEDS 

Human EPS8L2 pray sequence - varl (SEQ ID NO: 59) 

TCNTNCGCCGCCATGGNAGTACCCATACGACGTACCAGNATTACGCTCATATGGCCATGGNAGGCCAGTG 
AATTCCACCCAAGCAGTGGTATCAACGCAGAGTGGCCATTATGGCGGGGGG71ACAAAGACGAGCTCATGC 

AGCACATGGACGAGGTCAACGACGAGCTCATCCXK3AAAATCA 

CTTCCGCGTGGAGCGCAGCCAGCCCGTGAGCCAGCCGCTCACCTACGAGTCGGGTCCGGACGAGGTCCGC 
GCCTGGCTGGAAGCCAAGGCCTTCAGCCCGCGGATCGTGGAGAACCTGGGCATCCTGACGGGGCCGCAGC 
TCTTCTCCCTCAACAAGGAGGAGCTGAAGAAAGTGTGCGGCGAGGAGGGCGTCCGCGTGTACAGCCAGCT 
CACCATGCAGAAGGCCTTCCTGGAGAAGCAGCAAAGTGGGTCGGAGCTGGAAGAACTCATGAACAAGTTT 
*CATTCCATGAATCAGAGGAGGGGGGAGGAGAGCTAGGCCCAGCTGCCTTGGGCTGGGGCCTGCGGAGGGG 
AAGCCCACCCAGAATGCATGGAGTATTATTTTTATATGTGTATGTATTTTGTATCAAGGACACGGAGGGG 
GGTGTGGTGCTTGGCTANAGGTCCCTGCCCCTGTTTGGNAGGCACAACNCCCATNCTTTAGNCCAAANAG 

TNACCCAANGGCCTNAACCCCAANCAAGNTTATTTTNANNC 

CCNTTGTTGGTGCCNITOCCCNTTGTNCANOT 

CNTTTTTTNTNN 

Human EPS8L2 pray sequence - var2 (SEQ ID NO: 60) 

CGAGCXSCCGCCTGGNATACCCATACGACGTACCAGNATTACGCTCATATGGCC^TGGNAGGCCAGTGAAT 
TCCACCCAAGCAGTGGTATCAACGCAGAGTGGCCATTATGGCGGGGGGAACA^GACGAGCTCATGCAGC 
ACATGGACGAGGTCAACGACGAGCTCATCCGGAAAATCAGCAACATCAGGGCGC^ 

CCGCGTGGAGCGCAGCCAGCCCGTGAGCCAGCCGCTCACCTACGAGTCGGGTCCGGACGAGGTCCGCGCC 
TGGCTGGAAGCCAAGGCCTTCAGCCCGCGGATCGTGGAGAACCTGGGCATCCTGACCGGGCCGCAGCTCT 
TCTCCCTCAACAAGGAGGAGCTGAAGAAAGTGTGCGGCGAGGAGGGCGTCCGCGTGTACAGCCAGCTCAC 
CATGCAGAAGGCCTTCCTGGAGAAGCAGCAAAGTGGGTCGGAGCTGGAAGAACTCATGAACAAGTTTCAT 
TCCATGAATCAGAGGAGGGGGGAGGACAGCTAGGCCCAGCTGCCTTGGGCTGGGGCCTGCGGAGGGGAAG 
CCCACCCACAATGCATGGAGTATTATTTTTATATGTGTATGTATTTTGTATCAAGGACACGGAGGGGGTG 
TGGTGCTGGCTANAGTTCCCTGCCCTTGTOTGGAGGCACAC^CCCATCCTTAGGCCAAACANTACCNAGG 

NCTNANCCG&CACCAANACTATTTTAACCNAA 

CCNTTOTCCCNTTTTTTTGNGNCCNAAAATTCNTGGGGCTGGGC^^ 

TNGGGGTTCTGGNCCNTNNNNTNTNCCCCTNCC^ 

Human GOCAP1 mRNA sequence - varl (public gi: 10438O60) (SEQ ID NO: 61) 

GATACGTGGCTGCCGTCTGTCCCC^CTGAGGAGGTGCAGCAGCCGGAGATGGCGGCGGTGCTGAACGCAG 
AGCGACTCGAGGTGTCCGTCGACGGCCTCACGCTCAGCCCGGACCCGGAGGAGCGGCCTGGGGCGGAGGG 
CGCCCCGCTGCTGCCGCCACCGCTGCCACCGCCCTCGCCACCrGGATCCGGTCGCGGCCCGGGCXSCCTCA 
GGGGAGCAGCCCGAGCCCGGGGAGGCGGCGGOTGGGGGCGCGGCGGAGGAGGCGCGGCGGCTGGAGCAGC 
GCTGGGGTTTCGGCCTGGAGGAGTTGTACGGCCTGG 

ATTT<^TCC7U^CTTATGAAGAAA^TTGAAGCTTGTGGCACTGC^TAAGC^GTTCTTATGGGCCCATAT 

AAT CCAGACACTTGTCCTGAGGTTGGATT CTTTGATGTGTTGGGGAATGACAGGAGGAGAGAATGGGCAG 

CCCTGGGAAACATGTCTAAAGAGGATGCCATGGTGGAGTTTGTCAAGCTCTTAAATAGGTGTTGCCATCT 

CTTTTCAACATATGTTGCGTCCCACAAAATAGAGAAGGAAGAGCAAGACAAAAAAAGGAAGGAGGAA 

GAGCGAAGGCGGCGTGAAGAGGAAGAAAGAGAACGTCTGCAAAAGGAGGAAGAGAAACGTAGGAGAGAAG 

AAGAGGAAAGGCT^CXSACGGGAGGAAGAGGAAAGGAGACGGATAGAAGAAGAAAGGCTTCGGTTGGAGCA 
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GCAAAAGCAGCAGATAATGGCAGCTTTAAACT^^ 
CGATGACGACGAGGAGGAAGAAGAAAACM^ 

ttgctggatgagattgtgcctgtgtaccgac^ 
atccagggagaggagtctatctcctcaagtttgacaactc^ 

ctacagagtctattatactagata^ - 

(^TTTAAITTGGAAATTTCTTTTTACTTTTGTGGAGCATTAGA^ 
CTGATGGTTTGTGAACTCTTGCTGGGAATCAAAATTTCCTTGAGA 

TAAAGGAGATTCCTCAGACTC^TCCAGCCCTTGGGTGCTGACCAGCAGAGTCACTAGTGGATGCTGAAGT 

TAGATGAGCTACATGTTAAATATTTAAAGTCTCCAAAATAAAACACCCCAACGTTGACCTTACCCGGCTG 

ATGGTTAGCCCCTTGCTGCCTGCTCCATGTGTCTTATGAGAGCCCGTAGTTACAGTGTCCTCTAATTTGA 

AATCCATAAGTTAACAAGTCTATATCAGGTGCAGCTGGCTTTGATTAAAGGCCATTT^ 

ACTCAACACCTCACAGATTATAAAAAAAAAAAAAAAAAAA 

Human GOCAP1 mRNA sequence - var2 (public gi: is 826 85 1) (SEQ ID NO: 62) 

GGAAGTCGATACGTGGCTGCCTTCTGTCCCCGCTGAGGAGGTGCAGCAGCCGGAGATGGCGGCGGTGCTG 
AACGCAGAGCGACTCGAGGTGTCCGTCGACGGCCTCACGCTCAGCCCGGACCCGGAGGAGCGGCCTGGGG 
CGGAGGGCGCCCCGCTGCTGCCGCCACCGCTGCCACCGCCCTCGCCACCTGGATCCGGTCGCGGCCCGGG 
CGCCTCAGGGGAGCAGCCCGAGCCCGGGGAGGCGGCGGCTGGGGGCGCGGCGGAGGAGGCGCGGCGGCTG 
GAGCAGCGCTGGGGTTTCGGCCTGGAGGAGTTGTACGGCCTGGCACTGCGCTTCTTCAAAGAAAAAGATG 

GCAAAGCATTTCATCCAACTTATGAAGAAAAATTGAAGCTTGTGGC^ 
' CCCATATAATCCAGACACTTGTCCTGAGGTTGGATTCTTTGATGTGTTGGGGAATGACAGGAGGAGAGAA 
TGGGCAGCCCTGGGAAACATGTCTAAAGAGGATGCCATGGTGGAGTTTGTCAAGCTCTTAAATAGGTGTT 
GCCATCTCTTTTCAACATATGTTGCGTCCCACAAAATAGAGAAGGAAGAGCAAGAAAAAAA^GGAAGGA 
GGAAGAGGAGCGAAGGCGGCGTGAAGAGGAAGAAAGAGAACGTCTGCAAAAGGAGGAAGAGAAACGTAGG 
AGAGAAGAAGAGGAAAGGCTTCGACGGGAGGAAGAGGAAAGGAGACGGATAGAAGAAGAAAGGCTTCGGT 
TGGAGCAGCAAAAGCAGCAGATAATGGCAGCTTTAAACTCCCA 
AGCCCAACAGTATCCAGGGAACTACGAACAGmGCA^ 
CAGCAGTACATGCAGCAJGTTGTATCAAGTCCAGCTTGCA 

AAGTAGTAGTGGCTGGGTCTTCCTTGCCTACATCATCAAAAGTGAATGCAACTGTACCAAGTAATATGAT 
GTCAGTTAATGGACAGGCG?VAAACACACACTGACAGCTCCGAAAAAGAACTGG 

GAAGCCCTGGAGAATGGACCAAAAGAATCTCTTC^GTAATAGCAGCTCCATCCATGTGGACACGACCTC 
AGATCAAAGACTTCAAAGAGAAGATTCAGCAGGATGCAGATTCCGTGATTACAGTGGGCCGAGGAGAAGT 
GGTCACTGTT CG AGTACCCACCCATGAAGAAGGAT CATATCT CTTTTGGGAATTTGCCACAGACAATTAT 
GACATTGGGTTTGGGGTGTATTTTGAATGGACAGACTCTCCAAACACTGCTGTCAGCGTGCATGTCAGTG 
AGTCCAGCGATGACGACGAGGAGGAAGAAGAAAACATCGGTTGTGAAGAGAAAGCCAAAAAGAATGCCAA 
CAAGCCTTTGCTGGATGAGATTGTGCCTGTGTACCGACGGGACTGTCATGAGGAGGTGTATGCTGGCAGC 
CATCAATATCCAGGGAGAGGAGTCTATCTCCTCAAGTTTGACAAC^ 

CAGTCTACTACAGAGTCTATTATACTAGATAAAAATGTTGTTACAAAGTCTGGAGTCTAGGGTTGGGCAG 

AAGATGACATTTAATTTGGAAATTTCTTTTTACTTTTGTGGAGCATTAGA 

TATTGGTCTGATGGTTTGTGAACTCTTGCTGGGAATCAAAATTTCCT 

TTGGGGTTAAAGGAGATTCCTCAGACTCATCCAGCCCTTGGGTGCTGACCAGCAGAGTCACTAGTGGATG 

CTGAAGTTACATGAGCTACATGTTAAATATTTAAAGTCTCCAAAATAAAACACCCCAACGTTGACCTTAC 

CCGGCTGATGGTTAGCCCCTTGCTGCCTGCTCCATGTGTCTTATGAGAGCCCGTAGTTACAGTGTCCTCT 

AATTTGAAATCCATAAGTTAACAAGTCTATATCAGGTGCATCTGGCTTTGATTAAAGGCCATTTTTAAAA 

CTTAAAAACTCAACACCTCACAGATTATAATAGAAAAAGAAATGGCOT 

ACCCAGATTGTTTCTGCTTTGGGTGCAGCTGTTT^ 

ATAATCTTAAACTAGAATGTTCAAAAOTAATTGATAATTGAAGTATCAAGATACGTAGAACACCTCAGAG 

ATTTTTCTTCAGGAACTTCCACAAACTTTGAATCCTTGTATCTTTATTTGGTATTCATACT 

AAAATACAGGTTTTTTGTTTTGTTTTG1TTTGGCTTCATAGAGTATC 

AAGAAT AAAATTAAGGATTTTATAAACTCAAATTGG CACCTACTGAATTAAAATACATAAAAT CATTTAA 
ATATAATTCAGCATATGGGAAGTAACATTGCACT 

TTTAATTTGTTACTACTTAGTCACAAACCCCACATTATTCCAGTTTGGAATTACTTA 
GAAATACATATGCCCATGCTTAAATTTTATAGCTTTAATTTGTGTTATTTCTTTATTG 
ACATCTTTTTTTCCTTACTGAAAACCAAATATGGATTAATTGCCTCAAATTTGTATAAA 
GTGATTCTTGTTTTCAGGAAGGGAGAGTGGTATAGATAGAAAATGACAAAGATGGC^TATACACTTAAT 
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GTTGTTATTGTATGTTGTTAC^ 

tgca^mottaa^^tgaattatctcctaaaaatatcat^ 

arrarrrrTTCCTGTCATGTGTTCATGTTCA^ 

£gSga??ag?a^ 

GTTTATCTGTAAAAAAAAAAAAAAAAAA 

Human GOCAP1 mRNA sequence - var3 (public gi: 1579925s) (SEQ m NO: 63) 

AACGCAGAGCGACTCGAGGTGTCCGTCGACGGCCTCACGCTCAGCCCGGA^ 

CGGAGGGCGCCCCGCTGCTGCCGCCACCGCTGCCACCGCCCTCGCCACCTGGATCCGGTCGCGGCCCGGG 

CGC^GGG^GCAGCCCGAGCCCGGGGAGGCGGCGGCTGGGGGC^ 
GAGC^GCGCTGGGGTTTCGGCCTGGAGGAGTTGTACGGCCTGGCACTGC 

GCAAAGCATTTCATCCAACTTATGAAGAAAAATTGAAGCTTGTGGCACTGCATAAGC^GTTC 

cccatStaatccagacacttgtcctgaggttggattc 

TGGGCAGCCCTGGGAAACATGTCTAAAGAGGATGCCATGGTGGAGTTTGTCA 
GCCATCTCTTTTCAACATATGTTGCGTCCCAO^ 

ggaagaggagcgaaggcggcgtgaagaggaagaaagagaacgtctgcaaaaggaggaagag 
agagaagaagaggaaaggcttcgacgggaggaagaggaaaggagacggatagaagaagaaaggcttcggt 

^agI^gcaaaag^gcagat^^^ 
agcccaacagtatcc^gggaactacgaacagcagc^^ 

cag<^gtacatgcagc^gttgtatcaagtccagcttgcacagcaacaggcagcattacagaaacaacagg 
aagtagtagtggctgggtcotccttgcctaca^ 

gtcagttaatggacaggccaaaacacacactgac^gctccgaaaaagaactggaaccagaagctgcagaa 
gaagccctggagaatggacc^^gaatctcttccagtaatagcagctccatccatgtggacacgacctc 
agatcaaagacttcaaagagaagattcagcaggatgcagattccgtgattacagtgggccgaggagaagt 
ggt^ctgttcgagtacccacccatgaagaaggatcatatctcttttgggaatttgccacagacaattat 
gacattgggtttggggtgtattttgaatggacagactctccaaacactgctgtcagcgtgcatgtcagtg 
agtccagcgatgacgacgaggaggaagaagaaaacatcggttgtgaagagaaagccaaaaagaatgccaa 
caagcctttgctggatgagattgtgcctgtgtaccgacgggactgtcatgaggaggtgtatgctggcagc 

CATCAATATCCAGGGAGAGGAGTCTATCTCCTCAAGTTTGACy^CTCCTACrCTTTGTGGCGGTCAAAAT 

CAGTCTACTACAGAGTCTATTATACTAGATAAAAATGTTGTTAC^GTCTGGAGTCTAGGGTTGGGCAG 

AAGATGACATTTAATTTGGAAATTTCTTTTTACTTTTGTGGAGCATTAGAGTCACAGTTTACCTTATTGA 

TATTGGTCTGATGGTTTGTGAACTCTTGCTGGGAATCAAAATTTCCTTGAGACTCTTTAGCATTCATACT 

TTGGGGTTAAAGGAGATTCCTCAGACTCATCCAGCCCTTGGGTGCTGACCAGCAGAGTCACTAGTGGATG 

CTGAAGTTACATGAGCTACATGTTAAATATTTAAAGTCTCCABAATAAAACACCCCAACGTTGACCTTAC 

CCGGCTGATGGTTAGCCCCTTGCTGCCTGCTCCATGTGTCTTATGAGAGCCCGTAGTTACAGTGTCCTCT 

AATTTGAAATCCATAAGTTAACAAGTCTATATCAGGTGCATCTGGCTTTGATTAAAGGCCATTTTTAAAA 

CTTAAAAACTCAACACCTCACAGATTATAATAGAAAAAGAAATGGCCTCAGTTTGATCTCGTTCAGAATG 

ACCCAGATTGTTTCTGCTTTGGGTGCAGCTGTTTAGTTCAGAGTTATATTACAGAGAATTATTTTCTGAG 

ATAATCTTAAACTAGAATGTTCAAAACTAATTGATAATTGAAGTATC^GATACGTAGAACACCTCAGAG 

ATTTTTCTrCAGGAACTTCCACAAACTTTGAATCCTTGTATCTTTATTTGGTATTCATACTACTAGTAGC 

AAAATACAGGTTTTTTGTTTTGTTTTGTTTTGGCTTCATAGAGTATCTCAAATTGAAACTTTTCTGCACA 

AAGAATAAAATTAAGGATTTTATAAACTCAAATTGGCACCTACTGAATTAAAATA<^TAAAATCATTTAA 

ATATAATTCAGCATATGGGAAGTAACATTGCACTAATATGGAAATCACTGCCAGAGACAGTCTATTTTCT 

TTTAATTTGTTACTACTTAGTCACAAACCC(^CATTATTCCAGTTTGGAATTACTTATTAAGGAGAATTG 

GAAATACATATGCCCATGCTTAAATTTTATAGCTTTAATTTGTGTTATTTCTTTATTGACGGGAAGAGGT 

ACATCTTTTTTTCCTTACTGAAAACCAAATATGGATTAATTGCCTCAAATTTGTATAAAGTGATTGGCTA 

GTGATTCTTGTTTTCAGGAAGGGAGAGTGGTATAGATAGAAAATGACAAAGATGGCAATATACACTTAAT 

GTTGTTATTGTATGTTGTTACTGAAGTACTTAGATTTTTAAAATTTCAAATCCTAAATCACTTCTTGTAG 

GAGGGTTTTC^TTAACTGCAGTATATAC^GTTCACTACATATGGGTrGTTTGAGTTTTTTGTGTGCTGTA 

TTTCTTTCTGTTTTTTTAATACCTGGTTTTGTACATATCTAACTCTGTTCTCTTTTGGTTGTTCAGAAAC 

TGGATTTTTTTTTTTTCTTAAGCAGTGCTTAATTTGTGTTTTTTAATTTTGATTCAGAAGTAGTCCCAGC 

TCATAGGTGTTCATAACTGTTACATCCAGAACATTTGTCAGGCTCTCTGTCAGCTTTTCATGTACATATG 

GTATAGAAACCATGGAGTTAGGCACTTCCCTGGAATTTTTTTTTTTATGAGAAAAATACTGTATTTAAAA 

TGTAAAATAAACTTTTAAAAAGCAGGCACTAATATATATTTCTTCCAGCCTTTGATTACAAATTTGTCCT 

TGCACATGTTAAGATGAATTATCTCCTAAAAATATCATTGTTCTTGGGAGCAGTGTATGTTACTTTACAT 

AGCAGCGGTTCCTGTCATGTGTTCATGTTCAGAAATATTTTTGGTTTTAAACTTTCTTATTGCCTTTGGC 



Figure 36 part - 34 



p c t/ u s o »+./ O 6 3 o e 



TGTTGATTAGTACAGTACAAGTTGCGATTTCAAAAAGATCTT^ 
GTTTATCTGTAAAAAAAAAAAAAAAAAA 

Human GOCAP1 mRNA sequence - var4 (public gi: 21961496) (SEQ ID NO: 64) 

Gt3GAGGOGT<K3GTGCCATCTCT 
MAAAAGGAAGC^GGAAGAGGAGCGAAGGC^ 

AGAGAAACGTAGGAGAGAAGAAGAGGAAAGG ctt cgacgggaggaagaggaaaggagacggatagaagaa 

GAAAGC^TO 

TCCAGCAGTATGCAGCCCAACAGTATCCAGGGAACTACGAA^ 
GGAGCAACACTATCAGCAGTACATGCAGCAGTTGTATCAAGTCCAGCTTGCACA 

CAGAAACAACAGGAAGTAGTAGTGGCTGGGTCTTCCTTGCC 

CAAGTAATATGATGTCAGTTAATGGACAGGCCAAAACAGACACTGACAGCTCCGAAA 
AGAAGCTGCAGAAGAAGCCCTGGAGAATGGACCAAAAGAATCTCTTCCAGTAATAGCAGCTCCATCCATG 
TGGACACGACCTCAGATCAAAGACTTCAAAGAGAAGATTCAGCAGGATGCAGATTCCGTGATTACAGTGG 
GCCGAGGAGAAGTGGTCACTGTTCGAGTACCCACCCATGAAGAAGGATCATATCTCTT1TGGGAATTTGC 
CACAGACAATTATGACATTGGGTTTGGGGTGTATTTTGAATGGACAGACTCTCCAAACACTGCTGTCAGC 
GTGCATGTCAGTGAGTCCAGCGATGACGACGAGGAGGAAGAAGAAAACATCGGTTGTGAAGAGAAAGCCA 
AAAAGAATGCCAACAAGCCTTTGCTGGATGAGATTGTGCCTGTGTACCGACGGGACTGTCATGAGGAGGT 
GTATGCTGGCAGCCATCAATATCCAGGGAGAGGAGTCTATCTCCTCAAGTTTGACAACTCCTACTCTTTG 
TGGCGGTCAAAATCAGTCTACTACAGAGTCTATTATACTAGATAAAAATGTTGTTACAAAGTCTGGAGTC 
TAGGGTTGGGGAGAAGATGACATTTAATTTGGAAATTTCTTTTTACTTTTGTGGAGCATTAGAGTCACAG 
TTTACCTTATTGATATTGGTCTGATGGTTTGTGAACTCTTGCTGGGAATCAAAATTTCCTTGAGACTCTT 
TAGCATTCATACTTTGGGGTTAAAGGAGATTCCTCAGACTCATCCAGCCCTTGGGTGCTGACCAGCAGAG 
TCACTAGTGGATGCTGAAGTTAC^TGAGCTACATGTTAAATATTTAAAGTCTCCAAAATAAAACACCCCA 
ACGTTGACCTTACCCGGCTGATGGTTAGCCCCTTGCTGCCTGCTCCATGTGTCTTATGAGAGCCCGTAGT 
TACAGTGTCCTCTT^TTTGAAATCCATAAGTTAACAAGTCTATATCAGGTGCAGCTGGCTTTGATTAAAG 
jGCC ATTTTT AAAACTTAAAAACT CAACAC CTCACAGATTATAATAGAAAAAGAAATGGCC TCAGTTTGAT 
CTCGTTCAGAATGACCCAGATTGTTTCTGCTTTGGGTGCAGCTGTTTA 
ATTATTTTCTGAGATAATCTTAAACTAGAATGTTC^AAACT 

GAACACCTCAGAGATTTTTCTTGAGGAACTTCCACAAACTTTGAATCCTTGTATCTTTATTTGGTATTCA 
TACTACTAGTAGCAAAATACAGGTTTTTTGTTTTGTTTTGTTTTGTTTTGGCTTCA 

TTGAAACTTTTCTGCACAAAGAATAAAATTAAGGATTTTATAAACTCAAATTGGCACCTACTGAATTAAA 
ATACATAAAATCATTTAAATATAATTCAGC^TATGGGAAGTAACATTGCACTAATATGGAAAT 
AGAGACAGTCTATTTTCTTTTAATTTGTTACTACTTAGTCACAAACCCC^CATTATT 
ACTTATTAAGGAGAATTGGAAATACATATGCCGATGCTTAAATTTTATAGCTTTAATTTGTGTTATTTGT 
TTATTGACGGGAAGAGGTACATCTTTTTTTCCTTACTGAAAACAAATATGGATTAATTGCCTCAAATTTG 
TATAAGTGATTGGCTAGTGATTCTTGTTTTCAGAAGGGAGAGTGGTATAGATAGAAAATGACAAAGATGG 
CAATATACACrTAATGTTGTTATTGTATGTTGTTACTGAAGTACTTAGATTTTTAAAATTTCAAATCCTA 
AATCACTTCTTGTAGGAGGGTTTTCATTAACTGCAGTATATACAGTTCACTACATATGGGTTGTTTGAGT 
I TTTTTGTGTGCTGTATTTCTTTCTGTTTTTTAATACCTGGTTTTGTACATATCTAACTCTGTTCTCTTTT 
GGTTGTTCAGAAACTGGATTTTTTTTTTTCTTAAGCAGTGCTTAATTTGTGTTTTTTAATTTTGATTCAG 
AAGTAGTCCCAGCTCATAGGTGTTCATACTGTTACATCCAGAACATTTGTCAGGCTCTCTGTCAGCTTTC 
ATGTAGATATGGTATAGAAACCATGGAGTTAGGCACTTCCTGGATCT 

ATTTAAAATGTAAAATAAACTTTTAAAAAGCAGGC^CTAATATATATTTCTTCCAGCCTTTGATTACAAA 
TTTGTCCTTGCAC^TGTTAAGATGAATTATCTCCTAAAAATATCATTGTTCTTGGGAGCAGTGTATGTTA 
CTTTACATAGCAGCGGTTCCTGTCATGTGTTCATGTCAGAGTATTTTTGGTTTTAAACTTTCTTATO 
TTTGGCTGTTGATTAGTACAGTACAAGTGCGATTTCAAAAAGATCTTGAAAGTAATATATTTAATCAATT 

AAAATGTTTATCTGTAAAAAAAAAAAAAAAAAAAAAAAA 

HiunaaGOCAPl mRNA sequence - var5 (public gi: 24495472) (SEQ ID NO: 65) 

CCGCTGAGGAGGTGCAGCAGCCGGAGATGGCGGCGGTGCTGAACGCAGAGCGACTCGAGGTGTCCGTCGA 
CGGCCTCACGCTCAGCCCGGACCCGGAGGAGCGGCCTGGGGCGGAGGGCGCCCCGCTGCTGCCGCCACCG 
CTGCCACCGCCCTCGCCACCTGGATCCGGTCGCGGCCCGGGCGCCTCAGGGGAGCAGCCCGAGCCCGGGG 
AGGCGGCGGCTGGGGGCGCGGCGGAGGAGGCGCGGCGGCTGGAGCAGCGCTGGGGTTTCGGCCTGGAGGA 
GTTGTACGGCCTGGCACTGCGCCTCTTCAAAGAAAAAGATGGCAAAGC^^ 

AAATTGAAGCTTGTGGCACTGCATAAGCAAGTTCTTATGGGCCCATATAATCCAGACACTTGTCCTGATO 
TTGGATTCTTTGATGTGTTGGGGAATGACAGGAGGAGAGAATGGGCAGCCCTGGGAAACATGTCTAAAGA 

GGATGCCATGGTGGAGTTTGTCAAGCTCTTAAATAGGTGTTGC^ 

CACAAAATAGAGAAGGAAGAGCAAGACAAAAAAA.GGAAGGAGGAAGAGGAGCGAAGGCGGCGTGAAGAGG 
AAGAAAGAGAGCGTCTGOU^GGAGGAAGAGAAACGTAGGAGAGAAGAAGAGGAAAGGCTTCGACGGGA 
GGAAGAGGAAAGGATACGGATAGAAGAAGAAAGGCTTCGGTTGGAGCAGCAAAAGCAGCAGATAATGGCA 
GCTTTAAACTCCCAGACTGCCGTGCAGTTCCAGCAGTATGCAGCCCAACGGTATCCAGGGAACTACGAAC 
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GCAGGMGCAGATTCCGTGATTACAGTGGCCG 
AAGGATCATATCTCTTTTGGGJ^TTTGCCACAGACA 

^^GACTCTCC^CACT^^ 

GAAAACATCGGTTGTGAAGAGAAAGCCAAAAAGAATGCCAA 

TGTACCGACGGGACTGTCATGAGGAGGTGTATGCTGGCT^CC^ 

CCTCAAGTTTGACAACTCCTACrCTTTGTGGCGGTCAAAATCAGTCT 

TAAAAATGTTGTTACAAAGTCTGC^GTCTAGGGTTGGGCAGAAGATGACATTTAATTTGGAAATTTCTTT 
TTACTTTTGTGGAGCATTAGAGTCACAGTTTACCTTATTGATATTGGTCTGATGGTTTCT - 
TGGGAATCAAAATTTCCTTC^GACTCTTTAGCATTCATACTTTGGGGTTAAAG 

TCCAGCCCTTGGGTGCTGACCAGCAGAGTCACTAGGGGATGCTGAAGTTACATGAGCTACATGTTAAATA 

TTTAAAGTCTCCAAAATAAAACACCCCAACGTTGACCTTACCCGGCTGATGGTTAGCCCCTTGCTGCCT 

CTCCATGTGTCTTATGAGAGCCCGTAGTTACAGTGTCCTCTAATTTGAAATCCATAAGTTAACAAGTCTA 

TATCAGGTGCAGCTGGCTTTGATTAAAGGCCATTTTTT^ 

ATAGAAAAAGAAATGGCCTCAGCTTGATCTCGTTCAGAATGACCCAGATTGTTTCTGCCTTGGGTGCAGC 
TGTTTAGTTCAGAGTTATATTACAGAGAATTA^ 

ATTGATAATTGAAGTATCAAGATACGTAGAACACCTCAGAGATTTTTCTTCAGGAACTTCCACAAACTTT 
GAATCCTTGTATCTTTATTTGGTATTCATACTACTAGTAGCAA?^ATACAGGTTTTTTGTTTTGTTTTGTT 
TTGTTTTGGCTTC^TAGAGTATCTCAAATTGAAACT^TTCTGCACAAAGAATAAAATTAAGGATTTTATA 
AACTCG AATTGG CAC CTACTGAATT AAAATACAT AAAATCATTTAAATATAATT CAGC AT ATGGGAAGT A 
ACATTGCACTAATATGGAAATCACTGCCAGAGACAGTCTATTTT 

AACCCCACATTATTCCAGTTTGGAATTACTTATTAAGGAGAATTGGAAATACATGTGCCCATGCTTAAAT 
TTTATAGCTTTAATTTGTGTTATC^ 

-AAATATGGATTAATTGCCTCAAATTTGTATAAAGTGATTGGCTAGTGATTCTTGTTTTCAGAGGGGAGAG 
TGGTATAGATAGAAAATGACGAAGATGGCAATATACACTTAATGTTGTTATTGTATGTTGTTACTGAAGA 
CTTAGATTTTTAAAATTTCAAATCCTAAATCACTTCTTGTAGGGGGGTTTTCATTAACTGCAGTATATAC 
AGTTCACTACATATGGGTTGTTTGAGTTTTTTGTGTGCTGTATTTCTTTCTGTTTTTTAATACCTGGTTT 

TGTACATATCTAACTCTGTTCTCTTTTGGTTGTT 

ATTTGTGTTTTTTAATTTTGATTCAGAAGTAGTCCCAGCTCATAGGCGTTGATACTGTTACATCa^GAAC 

ATTTGTCAGGCTCTCTGTCAGCTTTCATGTACATATGGTATAGAAACCATGGAGT^ 

TTTTTTTTATGAGAAAAATNCTGTATTTAA^^ 

TTTCTTCCAGCCTTTGATTACAAATTTGTCCTTGCACATGTTAAGATGAATTATCTCCTAAAAATATCAT - 
TGTTCTTGGGAGCAGTGTATGTTACTTTAGATAGCAGCGGT^ 

TTTGGTTTTAAACTTTCTTATTGCCTTTGGCTGTTGATTAGTACAGTACAAGTGCGATTTCAAAAAGATC 

TTGAAAGTAATATATTTAATCAATTAAAATGTTTAT^ 

AA 

Human GOCAP1 mRNA sequence - var6 (public gi: 28374435) (SEQ ID NO: 66) 

TCCGTCCCCGCTGAGGAGGTGCAGCAGCGGGAGATGGCGGCGGTGCTGAACGCAGAGCGACTCGAGGTGT 

CCGTCGACGGCCTCACGCTGAGCCCGGACCCGGAGGAGCX3GCCTGGGGCGGAGGGCGCCCCGCTGCTGCC 

GCCACCGCTGCCACCGCCCTCGCCACCTGGATCCGGTCGCGGCCCGGGCGCCTCAGGGGAGCAGCCCGAG 

CCCGGGGAGGCGGCGGCTGGGGGCGCGGCGGAGGAGGCGCGGCGGCTGGAGCAGCGCTGGGGTTTCGGCC 

TGGAGGAGTTGTACGGCCTGGCACTGCGCTTCTTCAAAGAAAAAGATGGCAAAGCATTT 

TGAAGAAAAATTGAAGCTTGTGGCACTGCATAAGCAAGTTCTTATGGGCCCATATAATCGA 

CCTGAGGTTGGATTCTTTGATGTGTTGGGGAATGACAGGAGGAGAGAATGGGCAGCCCTGGGAAACATGT 

CTAAAGAGGATGCCATGGTGGAGTTTGTCAAGCTeTTAAATAGGTGTTGCCATCTCTTTTC^ 

TGCGTCCCACAAAATAGAGAAGGAAGAGCAAGAAAAAAAAAGGAAGGAGGAAGAGGAGCGAAGGCGGCGT 

GAAGAGGAAGAAAGAGAACGTCTGCAAAAGGAGGAAGAGAAACGTAGGAGAGAAGAAGAGGAAAGGCTTC 

GACGGGAGGAAGAGGAAAGGAGACGGATAGAAGAAGAAAGGCTTCGGTTGGAGCAGCAAAAGCAGCAGAT 

AATGGCAGCTTTAAACTCCCAGACTGCCGTGCAGTT^ 

TACGAACAGCAGCAAATTCTCATCCGCCAGTTGCAGGAGCAACACTATCAGCAGTAC^ 

ATCAAGTCCAGCTTGCACAGCAACAGGCAGCATTACAGAAAG^ 

CTTGCCTACATCATCAAAAGTGAATGCAACTGTACC^GTAATATGATGTCAGTTAAT 

ACACAC^CTGACAGCTCCGAAAAAGAACTGGAACCGGAAGCTGCAGAAGAAGCCCTGGAGAATGGACCAA 

AAGAATCTCTTCGAGTAATAGCAGCTCCATCCATGTGGAGACGACCTCAG 

GATTCAGCAGGATGCAGATTCCGTGATTACAGTGGGCCGAGGAGAAGTGGTCACTGTTCGAGTACCCACC 
CATGAAGAAGGATCATATCTCTTTTGGGAATTTQCCACAGAC^ 

TTGAATGGACAGACTCTCCAAACACTGCTGTCAGCGXGCATGTCAGTGAGTCCAGCGATGACGACGAGGA 
GGAAGAAGAAAACATCGGTTGTGAAGAGAAAGCCAAAAAGAATGCCAACAAGCCTTTGCTGGATGAGATT 
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TTAAATATTTAAAGTCTCCAAAATAAAACACCCCAACGTTGJ 
CTGCCTGCTCCATGTGTCTTATGAGAGCCCGTAGTTACAGTH 
AAGTCTATATCAGGTGCAGCTGGCTTTGATTAAAGGCCATT 
GATTATAATAGAAAAAGAAATGGCCTCAGTTTGATCTCGTT* 



TTAAATATTTAA^x^^'— — ™— TCTCCTCTTATTTGA^TOCATAAGTTAAC 

rrAi 



ZTGCCTGCTCCATGTG1 
&AGTCTATATCAGGTGC 
3ATTATAATAGAAAAAG 
GTGCAGCTGTTTAGTTC 
AAAACTAATTGATAATT 
AAACTTTGAATCCTTG*] 

TTTTGTTTTGGCTTCATAGAGTATCTCAAAT1 
TAAACTCAAATTGGCA( 



CTGCCTGCTCCATGTGTCTTATGAGAGCCCGTA( 

- ^LiUtJU'iVJ 

ATCTCGr 
GAATTA' 
TAGAACi 
CATACTi 
AAACTT 
.CATAAA 
,GACAGT< 
TATTAA 1 
JTTGACG 
'AAGTGA 
S.TATACA 
?CACTTC 
iTTGTGT 

TAATTTGTGTTTTTTAATTTTGATTCAGAAGTjMTCCC^G(^ciTAGG , 3 rTCATACTGTTACATCCAGA 



GTGCAGC 
AAAACTA 
AAACTTT 
TTTTGTT 



aaactttgaatccitotatc^ 

TACATAAAATCATTTAAATATAATTCAGCATATGGGAAG 
lGAGACAGTCTATTTTCTTTTAATTTGTTACTACTTAGTC 

tacagttIScI^ 



JUVCAAATATGGATTAATTGCCTCAAATTTGTATAAGTGATTGGCTA 



ACATTTGTCAGGCTCTCTGTCAGCTCTCATGXACATA 

rG 

XTCATTCTTCTTGGGAGCAGTGTATGTT^ 



GATTTTTTTTTTTATGAGAAAAATACTGTATTTAAAAT 



g^Stgaaagtaatatatttaatcaattaaaatgtttatctgtcaaa^ 

Wntnan GOCAP1 mRNA sequence- var7 (publicgi:25058702)(SEQIDNO:67) 

tccSSgccctcgccacctggatccggtcgcggcccgggcgcctcaggggagcagcc^gcccg^ 
aa^gagaaogt^ 

ggamaggaaaggagacggatagaagaagaaaggcttcggttggagcagc^^ 
gcS?aaactcccagactgccgtgcagttccagcagtatgcagcccaacagtatccaggga^ 

ArCAGCAAATTCTCATCCGCCAGTTGCAGGAGCAACACTATCAGCAGTACA.TGCAGC^ 
rr^CT^CAG^CAG^ 

a S?cat?aa^^ 

rToIctG^T^AAAAAGAACTGGAACCAGAAGCTGCAGAAGAAG 

ScaSaataI^c^^^ 

SatgcIgattccgtgattacagtgggccgaggagaagtggtcactgttc^ 

™SiScTCCAAACA^ 

GA M,AC ^^Sa^gaaagccaaaaagaatgcc^ 
tgtaccgaSSS^^ 



GAAAACA' 



CCTCAAGTTTGACJ^CTCCT 

taaaaatgISgotacaaagtctggagtctagggttgggcag 

^Sttgtg^agca^agagtcacagtt 

tgggaatcaaamttccttga^ctctttagcato 

T^CAG^CT^gS^ 

Sta^gtctccaaaataaaacaccccaacgttgaccttaa^ 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
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Human GOCAP1 inRNA sequence - var8 (public gi: 273892s) (SEQ ID NO: 68) 

GAATTCCGTTGCrGTCGGAGCCOSTAGTTACAGTGTCCTCTAATTTGAAATCCATAAGTTA 
TATCAGGTACAGCTGGCTTTCATTAAAGGCCATTTT^ 

AATAGAAAAAGAAATGGCCTCAGTTTGATCTCGTTCAGAATGACCCAGATTGTTTTCTGCTTTGGGTGCA 

GCTGTTTAGTTCAGAGTTATATTAC^GAGAATTATTTTCTGAGAAATCTTAAAACTAGAATGTT 

TAATTCGATAATTGAAGTATCAAGATACGTAGAAC^CCT^ 

TTTAGAATCCTTGTATCTTTATTTGGTATTGATAOTACTAGTCGCAAAATACAGG 

TATAAAACTC7VAATTGG(^CCTACTGAATTAAAATACATAAAATGCA 
AGTAACATTGC^CTAATATGGAAATCACTGC^ 

TC^CAACCCCACATTATTCCAGTTTGGAATTACTTATTAAGGAGAATTGGAAATACATATGCC 
AAATTTTATAGCTTTAATTTGTGTTATTTCT^ 

AAACAAATATGGATTAATTGCCTCAAATTTGTATAAGTGATTGGCTAGTGATTCTTGTTTTCAGAGGGAG 

AGTGGTATAGATAGAAAATGAC&AAGATGGCAATATAra^ 

GTACTTAGATTTTTAAAATTTCAAAT^^ 

ACAGTTCACTACATATGGGTTGTTTGAGTTTTTTGTGT^ 
TTTGTACATATCTAACTCTGTTCTCTTTTGGTTGTTCAGAAACTGG 

TAATTTGTGTTTTTTAATTTTGATTCAGAAGTAGTCCCAGCTCATAGGTGTTCATACTGTTACATCCAGA 
ACATTTGTCAGGCTCTCTGTCAGCTTTCATGTACATATGGTATAGAAACCATGGAGTTAGGCACTTCCTG 
GATTTTTTTTTTATGAGAAAAATACTGTATTTAAAATGTAAAATAAACTTTTAAAAAGC 

Human GOCAP1 Protein sequence - varl (public gi: 24496473) (SEQ ID NO: 240) 

MAAVIiNAERLEVSVDGIiTLSPDPEERPGAEGAPLLPPPIjPPPSPPGSGRGPGASGEQPEPGEAAAGGAAE 

EARRLiEQRWGFGLEELYGIiALRLFKEKrXSKAFHPTYEEKLKLVAI^ 

DRRREWAALGI^SKEDAIWEFVKIjLNRCCHLFSTYVASHKIEKE 

EEKRRREEEERLRREEEERIRIEEERLRLEQQKQQIMAALNSQTAVQFQQYAAQRYPGNYEQQQILIRQL 

PEQHYQQYMMLYQVQIAQQQAALQKQQEVW^^ 

PE^EALENGPKESLPVIAAPSMOT^ 

MDNCDIGFGVYFEWTDSPNT^^ 

VYAGSHQYPGRGVYLLKFDNSYSLWRSKSVYYRVYYTR 

Human GOCAP1 Protein sequence - var2 (public gi: 21951497) (SEQ ID NO: 241) 

RTRGOiLFSTYVASHKIEKEEQEKKRKEEEERRRREEEERERLQKEEEKR^ 

QKQQEVVVAGSSLPTSSKVKATVPSNMMSVNGQAKTHTDSSEKELEPEAAEEALENG 

WTRPQI KDFKEKIQQDADS VI TVGRGEWTVRVPTHEEGS YLFWEFATDNYDI GFGVYFEWTOS PNTAVS 

r^f SSDDDEEEEBNI ^^^ 
WRS KS VYYRVYYTR 

Human GOCAPl Protein sequence - var3 (public gi: 15799259) (SEQ ID NO- 242) 

™^^^ STO ^ TLSPDPEERPGAEGAPLLPPPLI)PPSPPGSGR GPGASGEQPEPGEAAAGGAAE 
^^ E T QRWGFGLBELYG ^ RFF ^^ G ^ F HPTYEEKLIO J VALHKQVI^GPYNPDTCPEVGFFDVI^ 
^^^^^ / ^^^ L ^ CCmPSTW ^ HME ^EQEKKRKEEEERRRREEEERERLQKE 

ZZZ^SS! mQQLYQVQ ^ QQA ^^ E ^AGSSLPTSSKVNATVPSNNMSWGQAKTHTDSSEra 
^^^ ENGP ^ SLPVI ^ S ^ TRPQIroFrara ^ D ^ S VITVGRGEWTVRVPTHEEGSYLFWEF 

!S™H OCAP1 Protein sequence - var4 (public gi: io43806i) (SEQ ID NO: 243) 
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Uniaene Name; G0SR2 Unigene ID; Hs. 432552 

1 mKNA sequence - varl (public gi: 2316087) (SEQ ID NO: 69) 

.rrar^AAArGCACAAGCAGGTCCACGAGATCCAGTCTTGCATGGGACGCCTGGA 




Human GOSR2 mRNA sequence - var2 (public gi: 34 83 524) (SEQ ID NO: 70) 

TTTTTTTTTTCAGGACAGATTGGCCTTTATACTAAATTCCACAATATACCTGGTATTAGTACAGCCTGAA 
TCCGGGGCTGGTCACAGAAGGAAAAAGGTTGTAGTCCCTGAAAACAGAGTGTT^ 

ACAGATGTCTCCACGGTGGGATCTGCCCAC^CTGGCTGGGCAAAATGAGGGCCTGGCTGGCAGGTGCTAA 
TATATTTCAGGGAAG^^ 

AAATCACAAATCTTTGCCTACAGCTACTTTCTAAGGG 

AGCCACAGTCTCTGAACCTTAAAAGCCAACCACTCTATTAACAACTA 

AGACTGAACACTCCCGGGAAATAACACTGGCCTCACTTTAGAAAAGAGAAACACCC^ 

AAATCTTACTTGTTATCGGCAATAGCACTACATCTTGTTTCCCCTAGGTAGCTGCTTCCAGGGAATGGTG 

ACAAGTATTGGC^GTCAGTGATCTACATGTCACTGAGGCACAGGGGAGGGTGGCCAGGAGCACGAGGATG 

^ * n m™, Tv r*™* ~n a mrpT, a aTaTa aTRRr TaTnAGAAAAGGCCTCTTTCCTTTCCTTTCCACTTTTGCTC 



Human GOSR2 mRNA sequence - var3 (public gi: 21961348) (SEQ ID NO: 71) 

GGCCTGCCGGGCCGGCGACATGGATCCCCTGTTCCAGCAAACGCACAAGCAGGTCCACGAGATCCAGT^ 

TGCATGGGACGCCTGGAGACGGCAGACAAGCAGTCTC^ 
TAGACCAGATATTC^GCCGTCTAGAACGT^ 

AAATGCCAGACTTCGGGTTGACCAGTTAAAGTATGATGTCCAGCACCTGCAGACTGCGCTCAGA^ 

CAGCATCGGCGCCATGCAAGGGAGCAGCAGGAGAGACAGCGAGAAGAGCTTCTGTCT 

CTAACGACTCTGACACCACCATACCAATGGACGAATGACTGCAGTTTAAOT 

CAACGGCATGGATGACCTCATTTTAGATGGGCACAATATTTTAGATGGACTGAGGACCCAGAGACTGACC 

TTGAAGGTGGGGTCCCTGCTGGGGGACAGAGAGAAGGCCTCTTGTTTTAGCCTCATCCAACAGTTTAGTA 

ACTGTGTTTATATTTTGATTACK3TGTCCTCAAATTGTGATATTTTGATGACAAGACAGAGCCCTTGAGT^ 

TGGGATCCTTTCTGTTGGAGTTGAGTTATTGTGAGCCTGAAAGTACCCAGTTCCTTTGCCAGTGCTTGAA 

ACAAACCATGAAGTGGCCTCTCTTAGGATCCAGGTCTTTTCCCATTTACTGAACTTA 

TGCTACTACGAGGGGTCCAATCACAGGCTGAGAAATTGTGTTACAGAATCTACTCTTGGAAG 

CGTGGCTGTCCTTTGGTACCTCGCTTTAAGGTGGCTTTCCCTTAGGACCCCTACTGTGGACTGCCTTATA 

ACTAAAACCTTTTGTATTTTAGTAACTGAATCCCCACTGTGCAGTGTTAGGGCTGCCTGGTTGTTTGCAG 

TAGATTAGAGCTTTAGAAGCTTCTAGAGCTTCTAAAGCCCGTGCTGGTGATCCCAGCGACTCTTCACTCC 

CTAGCCTTAGGTATTCCTAGAAGCCCTGACCAGTTGGCACTGCTGAGACTCCAGCCCCTGGGAGTGGTTT 

ACAGAAACATTACACAGACTCTGATGTCAGTCATGATGTTTCAGCCTCTGCCCT^ , 

TGATGGATAATAGGGCGTGGGTTCTGTCTGTTATCAGGGTGTGGTCCCCTGTGAATGAAGCACT 

CACTGAGCTGTGAGAAACAGTCACTCGGAAGTGTGAGCr^ 

CTGTGAGCTCCACAGGACTTCAGTACGTTTCTGAACAGTCCCTGCCATCTCT 

CAAGCTGCAAGTGAGACTC^CCTCCTGCTGACAGTTGCAGTGT^ 

AGCAGCCTGCTGGGCGCTCCCCTTTCATGAGAC^ 

TCTGCCCTTTCTTTAGAAAACTGCTACTCTCCT 

CCAAACTCCATGTCACACTGATGAAGAGCCAGTGGGGGTTAGAGCGTCCTGTTAAGGCACA 
CCOVCTCAAGTCTGGCAGCGCTGGGGCATCAGC^CACOT 
GCTGTGAGGTGGCTGGGGGTTGAGACT^GAGGTTTCTAACTTTCCT 
TTTGTCTCTTGATTCCCTCCACCTGCCTCACACCCTGCCT 

AAGGAACATAGGAGAGGGC^TGAAGGGGCTAGGCTGAAGCACTCTGATGACTGGGG^ 
AAAATGAATACATTTTTTGAAATTTATC^ 

CAGTGATGAAATAATCTGTGTGTTCAGAGCC^GCAGGACTTTAGCAAG 

TCTCGGGGAAAAAAAAATACAAATACATTTCTCTGATCTCTGATGGCAATGAAGTT^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Human GOSR2 mRNA sequence - var4 (P^t£,}^^^S^^c 

cSgcaaggagccc^aacaaaaggcaaa^ 

SSScTCTCTCGAACCTT^^ 

AGATGGAGTGAGGACCaMsAGACTGAC^ 
c"tGGGCTT^^ 

gcaScaaggatcctoccagacagcacactttggaggaaggtctg^ 

CACCAGTGCTTTCCACACTTGACAGTGGTT 

ctctagtttggaataaaaattgcagaggtggaaaaaaaaaaaaaaaaaaa 

Humaa GOSR2 mRNA sequence - var5 (public gi: 12711466) (SEQ^NO: 73) 
agccggagccgtggcctgcggggccggcgacatggatcccctgttccagcaaacgc^ 

TCctlG^GCATAGACCAGATATrCAGCCGTCTAGAACGTCTGGAGATTT^ 

taac^aaaaggcaaaatgccagacttcgggt^ 

CTCAGAAACTTCCAGCATCGGCGCCATGCAAGGGAGCAGCAGGAGAGACAGCGAGAAGAGCTTCTGT 

• CCAGAAAGTCCaWgG^ 

CAGMACTGACCTTGAAGGGGACTCAGAAGAAGATCCTTGACATTGCCAACA^ 

CAOTGATGCGGCTCATCGAGAAGCGGGCTTTCCAGGACAAGTACTTTA 

CCAGACAG(^CACTTTGGAGGAA3GTCTGCAGGGAGCAGCTGAGCCATTTGTTCTTGAACTCTGGGA^ 

^GAAGTCCCCGCACCCATCATGCGTGGACTGATAGGAC^TCTTTTCGTGGTGTGCACCAG^CTTCCAC 

AC^GACAGTGGTTGGCTTTGATGAACCCTCATGCTGCACCTTCAGAGCCAGTCCTCTAGTTTGGAATAA 

AAATTGCAGAGGTGGAAAAAAAAAAAAAAAAAAA 

Human G0SR2 mRNA sequence - var6 (public gi: 37305253) (SEQIDNO: : 74) 

CAATAGAGACAAGGTCTTGCTCTGTCACCCAGGTTGGAGTACAGTGGCATGATCTTGATTCACTACAACC 
TCTACCTCTTGGGTTCAAGCGATCCTCCCACCTCGGTCTTCTGAGTAGCTGGGAATACAGTTATAATOAT 
TCAATATGTTCCCACTGACTGAGGAAAACAAGCATGTGGCCCAGTTGTTGCTCAATACTGGTACTTGTCC 
AAGATGTATCTTCAGATTCTGTGGTGTGGATTTTCATGCACCTTACAAACTTCCATACAAGATGAAGAAA 
CTGAGATACAGAGAGGTTAAGCAACCTCCCAAAGTTCTAGGGTTACAGGTGTTAGCCACTGT^CCT^CC 
TCTAAGGTGATTCTGATGTGTGTATl'TTGGAACCACTGTCTCCTAGACAGAAAGCTTCTGTCTCAAAGAT 
GATCACATTGGTGTAAAGAGCAAAACTTGTTAAGTCCAAAATAAATTCTTACTGTTTATATCCTAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human GOSR2 mRNA sequence - var7 (public gi: 16 905521) (SEQ ID NO: 75) 

GTTCCGAGGAAGCCAGAGCCGGAGCCGTGGCCTGCCGGGCCGGCGACATGGATCCCCTGTTCCAGCAAAC 
GCACAAGCAGGTCCACGAGATCCAGTCTTGCATGGGACGCCTGGAGACGGCAGACAAGCAGTCTGTGCAC 
ATAGTAGAAAACGAAATCCAAGCAAGC^TAGACCAGATATTCAGCCGTCTAGAACGTCTGGAGATTTTGT 
CCAGCAAGGAGCCCCCTAACAAAAGGCAAAATGCCAGACTTCGGGTTGACCAGTTAAAGTATGATGTCCA 
GCACCTGCAGACTGCGCTCAG7VAACTTCCAGCATCGGCGCCATGCAAGGGAGCAGCAGGAGAGACAGCGA 
GAAGAGCTTCTGTCTCGAACCTTCACCACTAACGACTCTGACACCACCATACCAATGGACGAATCACTGC 
AGTTTAACTCCTCCCTCCAGAAAGTTCACAACGGCATGGATGACCTCATTTTAGATGGGCACAATATTTT 
AGATGGACTGAGGACCCAGAGACTGACCTTGAAGGGGACTCAGAAGAAGATCCTTGACATTGCCAACATG 
CTGGGCTTGTCCAACACAGTGATGCGGCTCATCGAGAAGCGGGCTTTCCAGGACAAGTACTTTATGATAG 
GTGGGATGCTGCTGACCTGTGTGGTCATGTTCCTCGTGGTGCAGTACCTGACATGAGCCAGCCACGCTCA 
GTGGCTGAACAGCATTCCCACAGCCTGCAAGTGTGTGTGTGTGTGAAAGAGAGAGGGGGGCCCAGAGGCC 
GCCTTTTGAAATGTTTGCCTGTCTGAACTGTGAAGACACTTGGGAGTGATTGTGGTCTAATTTCCAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Hiunan G0SR2 protein sequence - varl (public gi: 16307241) (SEQ ID NO: 244) 

MDPLFQQTHKQVHEIQSCMGIUiETADKQSVHIVENEIQASIDQIFSRIjERIjEILSSKEPPNKRQNARLRV 

DQLKYDVQHMTAIjRNFQHRRHAREQQERQREELLSRTFTTND^ 

IUXBHNIIiDGLRTQRLTLKGTQKKILDIANMLGL^ 

LT 
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TTnman GOSR2 protein sequence - var2 (public gi: 16905S22) (SEQ ID NO: 245) 

LT 

Human GOSR2 protein sequence - var3 (public gi: 1271X467) (SEQ ID NO: 246) 

GSS 

Human GOSR2 protein sequence - var4 (public gi: 21961349) (^QJ?NO: 247) 

Slfqqthkqvheiqscmgrletadkq^ 

dqlkydvqhlqta™qhrrha^^ 

II^GHNIU3GLRTQRIiTI.KVGSLLGDREKASCFSLIQQFSNCVYIIiITCPQIVIF 

Human GOSR2 protein sequence - var5 (public gi: 23 isoss) (SEQ ID NO: 248) 

^PLFWTHKQVHEIQSCMGRLETADKQSWIVENEIQASIDQIFSRLERLEILSSKEPPN^QN^RV 

DQLKYDVQHiQTALRNFQHRRHAREQQERQREEIjIjCRTFTTNGSDTTIPMDESbQ 

ILDGHNI^GLRTQRLTLKGTQKKIPDIANMLGLSN^ 

LT 

Human GOSR2 pray sequence - varl (SEQ ID NO: 76) , 

AGCGCCGCCATGGNAGTACCCATNCGACGTACCAGATTACGCTCATATGGCCATGGACMCC^GTG^TTC 
^CCCAAGCAGTGGTATCAACGCMAGTGGCCATTATGGCCG^ 

GTTCCGAGGAAGCCAGACCCGGAGCCGTGGCCTGCCGGGCCGGCGACATGGATCCCCTGTTCCAGCAAAC 
GCA(^GCAGGTCCACGAGATCCAGTCTTGCATGGGACGCCTGGAGACGGCAGACAAGCAGTCTGTGCAC 
ATAGTAGAAAACGAAATCCAAGCAAGCATAGACCAGATATTC^GCCGTCTAGGACGTCIGGAGATTTOGT 
CCAGCAAGGAGCCCCCTAACAAAAGGCAAAATGCCAAACTTCGGGTTGACCAGTTAAAGTATGATGTCCA 
GCACCTGCAGACTGCGCTCAGAAACTTCCAGCATCGGCGCNATGCAAGGGAGCAGCGGGAGAGACAGCGA 

GAAGANCTTOTGTCTCNAACCTTAACC^ 
■NGTTAACamiCCTCCCNCNAAAAAGTTACAAACGGCTTGNm 

TNAATNGCNTTGGGNNCeCRAAACCTTCCTTTNGNGGGGGGGNCCOTTTGGGGGGA?^^ 
TTTTTTTTANCCOTnOlHCA^^ 

TTTTNWAAANAAAAACNCCCCCCTNTGGGGGGGCCTTNTTTTGGGGNGGMJNTTTTTGNNCCANNAAAA 
ACCCCMTTTTNTNWGGNNGGARAAAAAAAAAAAGNNNNTNTTTNTA 

Human HERPUD 1 mRNA sequence - varl (public gi: 16507801) (SEQ ID NO: 77) 

AGAGACGTGAACGGTCGTTGCAGAGATTGCGGGCGGCTGAGACGCCGCCTGCCTGGCACCTAGGAGCGCA 
GCGGAGCCCCGACACCX5CCGCCGCCGCCATGGAGTCCGAGACCGAACCCGAGCCCGTCACGCTCCTGGTG 
AAGAGCCCCAACCAGCGCCACCGCGACTTGGAGCTGAGTGGCGACCGCGGCTGGAGTGTGGGCCACCTCA 
AGGCCCACCTGAGCCGCGTCTACCCCGAGCGTCCGCGTCCAGAGGACCAGAGGTTAATTTATTCTGGGAA 
GCTGTTGTTGGATCACCAATGTCTCAGGGACTTGCTTCCAAAGGAAAAACGGCATGTTTTGCATCTGGTG 
TGCAATGTGAAGAGTCCTTCAAAAATGCCAGAAATCAACGCCAAGGTGGCTGAATCCACAGAGGAGCCTG 
CTGGTTCTAATCGGGGACAGTATCCTGAGGATTCCTCAAGTGATGGTTTAAGGCAAAGGGAAGTTCTTCG 
GAACCTTTCTTCCCCTGGATGGGAAAACATCTCAAGGCATCACGTTGGGTGGTTTCCATTTAGACCGAGG 
CCGGTTC^GAACTTCCCAAATGATGGTCCTCCTCCTGACGTTGTAAATCAGGACCCCAACAATAACTTAC 
AGGAAGGCACTGATCCTGAAACTGAAGACCCCAACCACCTCCCTCCAGACAGGGATGTACTAGATGGCGA 
GC^GACCAGCCCCTCCTTTATGAGCACAGCATGGCTTGTCTTCAAGACTTTCTTTGCCTCTCTTCTTCCA 
GAAGGCCCCCCAGCCATCGCAAACTGATGGTGTTTGTGCTGTAGCTGTTGGAGGCTTTGACAGGAATGGA 
CTGGATCACCTGACTCCAGCTAGATTGCCTCTCCTGGACATGGCAATGATGAGTTTTTAAAAAACAGTGT 
GGATGATGATATGCTTTTGTGAGCAAGCAAAAGCAGAAACGTGAAGCCGTGATACAAATTGGTGAACAAA 

aaatgcccaaggcttctcatgtctttattctgaagagctttaatAtatactctatgtagtttaataagca 

CTGTACGTAGAAGGCCTTAGGTGTTGCATGTCTATGCTTGAGGAACTTTTCCAAATGTGTGTGTCTGCAT 
GTGTGTTTGTACATAGAAGTCATAGATGCAGAAGTGGTTCTGCTGGTACGATTTGATTCCTGTTGGAATG 
TTTAAATTACACTAAGTGTACTACTTTATATAATCAATGAAATTGCTAGACATGTTTTAGCAGGACTTTT 
CTAGGAAAGACTTATGTATAATTGCTTTTTAAAATGCAGTGCTTTACTTTAAACTAAGGGGAACTTTGCG 
GAGGTGAAAACCrTTGCTGGGTTTTCTGTTCAATAAAGTTTTACTATGAATGACCCTGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Human HERPUD1 mRNA sequence - var2 (P^SSH^SSS^S 

GCTGTGTGGCCCAGGCTTTTCTCAAACTCCTGAGGG^GCGATCCTCC^CCT^^^G^ 

TGGGACTAdAGGCATGTGCCACTAGACCTGGCTCTAAAGACATATATGA^ 
CATTTC^CAATGTTTATTCACATATATGGTATTAGTATTCTAATGTAGTGATGCACTCTA^ 

* CTTAAAAGCACTTT ATCATTTCATTTCCCTGCACTGXAATTT^ 

'gaIgg^a^atattgc^tactattttagaaagttgtgggctgggttgtatttata^ 

TCAGATGTCTGCAATGAGTAAATTTA 

AA^TT^AATGAAAGTGTAGCATACTOTAGGGAAAAAATATGAATATTTTAGCATCTATOT 

™gtWtaaatgtcagactatt^ 

TGGGGGGTAGGAGATGTAAGC^ _ 
GCTCCTTTGTTCTGTGTTCATGTTACACTTATTTAGGTGGCTGAATCCACAGAGGAGCCTGC^CTCTA 

ATCGGGGA^AGTATCCTrGAGGATTCCTCAAGTGATGGTTTAAGGCAAAGGGAAG 

TTCCCCTGGATGGGAAAACATCTCAAGGCCTGAAGCTGCCCAGCAGGCATTCCAAGGCCT^GTCCTGGT 

SctctcgSac^caccctatc^^ 
IcSgcama^ 

tgt^tctctgcacctgctccagcccctattcacaaccagtttcq^g 

Ltgctgctcctcaagtggttgttaatcctggagccaatcaaaatttgcggatgaatgca^^tggcc 
ctattgtggaagaagatgatgaaataaatcgagattggttgga^ 

tgtttttctcagtatcctctacttctactcctccctgaggagattcctcatggtcatgggggccaccgtt 
gttatgtacctgcatcacgttgggtggtttccatttagaccgaggccggttcagaacttcccaaatgatg 
gtcctcctcctgacgttgtaaatcaggaccccaacaataacttacaggaaggcactgatcctgaaactga 
agaccccaaccacctccctccagacagggatgtactagatggcgagcagaccagcccctcctttatgagc 
acagcatggcitgtottcaagactttctttgcctctcttcttccagaaggccccccagccatcgcaaact 

GATGGTGTTTGTGCTGTAGCTGTTGGAGGCTTTGACAGGAATGGACTGGATCACCTGACTC(^GCTAGAT 
TGCCTCTCCTGGACATXMC^TGATGAGTT.TTTAAAAAACAGTGTGGATGATGATATGCTTTTGTGAGCA 

'agcaaaagcagaaacgtgaagccgtgatacaaattggtgaacaaaaaatgcccaaggotctcatgtctt 

TATTCTGAAGAGCTTTAATATATACTCTATGTAGTTTAATAAGCACTGTACGTAGAAGGCCTTAGGTGTT 
GCATGTCTATGCTTGAGGAACTTTTCCAAATGTGTGTGTCTGCATGTGTGTTTGTACaTAGAAGTCATAG 
ATGCAGAAGTGGTTCTGCTGGTACGATTTGATTCCTGTTGGAATGTTTAAATTACACTAAGTGTACTACT 
TTATATAATCAATGAAATTGCTAGACATGTTTTAGCAGGACTTTTCTAGGAAAGACTTATGTATAATTGC 
TTTTTAAAATGCAGTGCTTTACTTTAAACTAAGGGGAACTTTGCGGAGGTGAAAACCTTTGCTGGGTTTT 

CTGTTCAATAAAGTTTTACTATGAATGACAAAAAAAAAAAAAAAAAA 

Human HERPUD1 mRNA sequence - var3 (public gi: 3005722) (SEQ ID NO: 79) 

GGCCACCTCAAGGCCCACCTGAGCCGCGTCTACCCCGAGCGTCCGCGTCCAGAGGACCAGAGGTTAATTT 

ATTCTGGGAAGCTGTTGTTGGATCACCAATGTCTCAGGGACTTGCTTCCAAAGGAAAAACGGCATGTTTT 

GCATCTGGTGTGCAATGTGAAGAGTCCTTCAAAAATGCCAGAAATCAACGCCAAGGTGGCTGAATCCACA 

GAGGAGCCTGCTGGTTCTAATCGGGiSACAGTATCCTGAGGATTCCTCAAGTGATGGTrrAAGGCAAAGGG 

AAGTTCTTCGGAACCTTTCTTCCCCTGGATGGGAAAACATCTCAAGGCCTGAAGCTGCCCAGCAGGCATT 

CCAAGGCCTGGGTCCTGGTTTCTCCGGTTACACACCCTATGGGTGGCTTCAGCTTTCCTGGTTCCAGCAG 

ATATATGCACGACAGTACTACATGCAATATTTAGCAGCCACTGCTGCATCAGGGGCTTTTGTTCCACCAC 

CAAGTGCACAAGAGATACCTGTGGTCTCTGCACCTGCTCCAGCCCOTATTCACAACCAGTTTCCAGCTGA 

AAACCAGCCTGCCAATCAGAATGCTGCTCCTGRAGTGGTTGTTAATCCTGGAGCCAATCAAAATTTGCGG 

ATGAATGCACAAGGTGGCCCTATTGTGGAAGAAGATGATGAAATAAATCGAGATTGGTTGGATTGGACCT 

ATTCAGCAGCTACATTTTCTGTTTTTCTCAGTATCCTCTACTTCTACTCCTCCCTGAGCAGATTCCTCAT 

GGTCATGGGGGCCACCGTTGTTATGTACCTGCATCACGTTGGGTGGTTTCCATTTAGACCGAGGCCGGTT 

CAGAACTTCCCAAATGATGGTCCTCCTCCTGACGTTGTAAATCAGGACCCCAACAATAACTTACAGGAAG 

GCACTGATCCTGAAACTGAAGACCCCAACCACCTCCCTCCAGACAGGGATGTACTAGATGGCGAGCAGAC 

CAGCCCCTCCTTTATGAGCACAGCATGGCTTGTCTTCAAGACTTTCTTTGCCTCTCTTCTTCCAGAAGGC 

CCCCCAGCCATCGCAAACTGATGGTGTTTGTGCTGTAGCTGTTGGAGGCTTTGACAGGAATGGACTGGAT 

CACCTGACTCCAGCTAGATTGCCTCTCCTGGACATGGCAATGATGAGTTTTTAAAAAACAGTGTGGATGA 

TGATATGCTTTTGTGAGCAAGCAAAAGCAGAAACGTGAAGCCGTGATACAAATTGGTGAACAAAAAATGC 

CCAAGGCTTCTCATGTCTTTATTCTGAAGAGCTTTAATATATACTCTATGTAGTTTAATAAGCACTGTAC 

GTAGAAGGCCTTAGGTGTTGCATGTCTATGCTTGAGGAACTTTTCCAAATGTGTGTGTCTGCATGTGTGT 

TTGTACATAGAAGTCATAGATGCAGAAGTGGTTCTGCTGGTACGATTTGATTCCTGTTGGAATGTTTAAA 

TTACACTAAGTGTACTACTTTATATAATCAATGAAATTGCTAGACATGTTTTAGCAGGACTTTTCTAGGA 

AAGACTTATGTATAATTGCTTTTTAAAATGCAGTGCTTTACTTTAAACTAAGGGGAACTTTGCGGAGGTG 

AAAACCTTTGCTGGGTTrrCTGTTCAATAAAGTTTTACTATGAATGACCCTGAAAAAAAAAAAAAAAAAA 

AAAA 
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Ul1 _ an ttprpt rni mRNA sequence - var4 (public gi: 21619176) (SEQ ID NO: 80) 

TOCTGCTTCTAATCGGGGACAGTATCCTGAGGATTCCTC 

HGG^CCTTTCTTCCCCTGGATGGGAAAACATCTCAAGGCCTGAAGCTGCCCAG^GG^C^GGCC 
ACGACAGTACTACATGCAATATTTAGCAGCCACTGCTGC^ 

Sg^gSccta^tc^ 

aGGCCACCGTTGTTATGTACCnX3CATCA^ 

======= 

ctcmgtctttattctgaagagctttaatat^ 

gaagSSa^^ 
2to?aSaot£at^^ 

gctgggttttctgttcaataaagttttactatgaatgaccctgaaa^ 

Human HERPUD1 mRNA sequence - var5 (public gi: 14249882) (SEQ ID NO: 81) 

S^CG^GCAGAGA^^ 

Sacaccgc^c^Scca^ggagtccgagaccgaacccg 

aacSgcgccaccgcgac^ 

^gccgcg^taccccgagcgtccgcgtc^ 

ggatcaccaatctctcagggactt^ 

^agScaaaaatgccagaaatcaacgccaaggtgg^ 

r^AaTrRGGGACAGTATCCTGAGGATTCCTC^GTGATGGTTTAAGGCAAAGGGAAGTTCTTCGG^^i 
SSScCGOTTA^^ 

ACTACATGCAATATTTAGCAGCCACTGCTGCATCAGGGGCrrTTTGTT 

rarz^TGCTGCT^CTCAAGTGGTTGTTAATCCTGGAGC 

WCtoStTCTCAGTATCCT^^ 
^GTTATGTACCTGCATCACGTTGGGTGG^ 

ATOCTCCTCcicCTGACGTrGTAAATCAGGACCCC^ 
TGAAGACCCGAACCIACCTCCCTCCAGACAGGGATGTACTA 

I^Sgca^cSgtcttcaagactttctttgcctctcttcttccagaaggccccccagccatcgcaa 

gmtgcctctc^ 

S£aaaT£^^ 
ctotattctgaagagctttaatatatactcta^ 

^TG^ATOTCTATGCTTGAGGAACTTTTCCyVAATGTGTGTGTCTGCATGT 
TAQATGCAQAftGTGGTTCTGCTGGTACGATTTGATTCCIGTTGGAATGTOTAAATT 

ACTTTATATAATCAATGAAATTGCTAGACATGTTTTAGCAGGAOT 

JSStaaaa^^ 

tttctgttcaataaagttttactatgaaaaaaaaaaaaaaaaaa 



Human HERFUD1 mRNA sequence - var6 (public gi: i26S2674> ^ (SBQIDN 0.82) 

GAACTGTCGTTGCAGAGATTGCGCXSCGGCTGAGA^^ 

CCGACACCGCCGCCGCCGCCATGGAGTCCGAGACCGAACCCGAGCCCGTCACGCTCCTGGTGAAGAGCCC 
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mCCTCCCCTCGATGGGAAAACATCTCAAGGCCTG^ 

tcgSctccggttacacaccctatgggt 

TACTACATGCAATATTTAGCAGCCACTGCTGCATCAGGGGCTTTTGTTCC^ 

Scctctgg^ctgcacctgctccagcccctattcacaacc^gtttccagctgaaa^ 
JcagISgItg^ 

gScctattotcgaagaagatgatgaaataaatcgagattggttggattggacctatt^ 
ScTOTTreTCTCAGTATCCTcra 

ctgaagaccccaacc^cctccctccagacagggatgtactagatggcgagcagaccagcccctcctotat 

gagcacagcatggcttgtcttc^gactttctttgcctctcttcttcc 

aacttoatggtgtttgtgctgtagctgttggaggctttgacaggaatc 

aIattgcctctcctggac^tggcaatgatgagtttttaaaaaacagtgtggatgatgatatg^ 

AGCAAGCAAAAGCAGAAACGTGAAGCCGTGATACAAATTGGTG 

J^at^aagagctttaatatatactctatgtagtttaatargcactgtacgtagaaggccttagg 

TOOTGCATGTCTATGCTTGAGGAACTTTTCCyU^ATGTGTGTGTCTGCATGTGTGTTTGTACATAGAAGTC 
TACTTTATATAATCAATGAAATTGCTAGACATGTTTTAGCAGGACTTTTCTAGGAAAGACTTATGTATAA 

SgS™aaa1?g1£^^ 

ttttctgttcaataaagttttactatgaatgaaaaaaaaaaaaaaaaaaaa 

Human HERPUD1 itiKNA sequence- var7 (public gi: 9 7ii684)(SEQIDNO: 83) 

AGAGACGTGAACTGTCGTTGCA.GAGATTGCGGGCGGCTGAGACGCCGCCTGCCTrGGCACCTAGGAGCGCA 
ScGGAGCCCCGACACCGCCGCCGCCGCCATGGAGT 

AAGAGCCCCAACCAGCGCCACCGCGACTTGGAGCTGAGTGGCGACCGCGGCTGGAGTGTGGGCCACCTCA 
AGGCCCACCTGAGCCGCGTCTACCCCGAGCGTCCGCGTCCAGAGGACCAGAGGTTAATTTMTCTG^AA 
GCTGTTGTTGGATCACCAATGTCTCAGGGACTTGCTTCCARAGCAGGAAAAACGGCATGTTTTGCATCTG 
GTGTGCAATGTGAAGAGTCCTTCAAAAATGCCAGAAATCAACGCCAAGGTGGCTGAATCCACAGAGGAGC 
CTGCTGGTTCTAATCGGGGACAGTATCCTGAGGATTCCTCAAGTGATGGTTTAAGGCAAAGGGAAGTTCT 
TCGGAACCTTTCTTCCCCTGGATGGGAAAACATCTCAAGGCCTGAAGCTGCCCAGCAGGCATTCCAAGGC 

CTGQQTCCxwmiTCKxacrrrj^^ 

ACAAGAGATACC^GG^CTGCACCTGCTCCAGCCCCTATTCACAA 

CCTGCCAATCAGAATGCTGCTCCTCAA.GTGGTTGTTAATCCTGGAGCCAATGAAAATTTGCGGATGAATG 
CACAAGGTGGCCCTATTGTGGAAGAAGATGATGAAATAAATCGAGATTGGTTGGATTGGACCTATTCAGC 
AGCTACATTTTCTGTTTTTCTCAGTATCCTCTACTTCTACTCCTCCCTGAGCAGATTCCTCATGGTCATG 
GGGGCCACCGTTGTTATGTACCTGCATCACGTTGGGTGGTTTCCATTTAGACCGAGGCCGGTTCAGAACT 
TCCCAAATGATGGTCCTCCTCCTGACGTTGTAAATCAGGACCCCAACAATAACTTACAGGAAGGCACTGA 
TCCTGAAACTGAAGACCCCAACCACCTCCCTCCAGACAGGGATGTACTAGATGGCGAGCAGACCAGCCCC 
TCCTTTATGAGCAC^GCATGGCTTGTCTTCAAGACTTTCTTTGCCTCTCTTCTTCCAGAAGGCCCCCCAG 
CCATCGCAAACTGATGGTGTTTGTGCTGTAGCTGTTGGAGGCTTTGACAGGAATGGACTGGATCACCTGA 
CTCCAGCTAGATTGCCTCTCCTGGACATGGCAATGATGAGTTTTTAAAAAACAGTGTGGATGATGATATG 
C1TTTGTGAGCAAGCAAAAGCAGAAACGTGAAGCCGTGATACAAATTGGTGAACAAAAAATGCCCAAGGC 
TTCTCATGTCTTTATTCTGAAGAGCTTTAATATATACTCTATGTAGTTTAATAAGCACTGTACGTAGAAG 
GCCTTAGGTGTTGCATGTCTATGCTTGAGGAACTTTTCCAAATGTGTGTGTCTGCATGTGTGTTTGTACA 
TAGAAGTCATAGATGCAGAAGTGGTTCTGCTGGTACGATTTGATTCCTGTTGGAATGTTTAAATTACACT 
AAGTGTACTACTTTATATAATCAATGAAATTGCTAGACATGTTTTAGCAGGACTTTTCTAGGAAAGACTT 
ATGTATAATTGCTTTTTAAAATGCAGTGCTTTACTTTAAACTAAGGGGAACTTTGCGGAGGTGAAAACCT 

TTGCTGGGTTTTCTGTTCAATAAAGTTTTACTATGAATGACCCTG 

Human HERPUD1 mRNA sequence - varS (public gi: 3005718) (SEQ ID NO: 84) 

GACGTGAACGGTCGTTGCAGAGATTGCGGGCGGCTGAGACGCCGCCTGCCTGGCACCTAGGAGCGCAGCG 
GAGCCCCGACACCGCCGCCGCCGCCATGGAGTCCGAGACCGAACCCGAGCCCGTCACGCTCCTGGTGAAG 
AGCCCCAACCAGCGCCACCGCGACTTGGAGCTGAGTGGCGACCGCGGCTGGAGTGTGGGCCACCTCAAGG 
CCCACCTGAGCCGCGTCTACCCCGAGCGTCCGCGTCCAGAGGACCAGAGGTTAATTTATTCTGGGAAGCT 
GTTGTTGGATCACCAATGTCTCAGGGACTTGCTTCCAAAGCAGGAAAAACGGCATGTTTTGCATCTGGTG 
TGCAATGTGAAGAGTCCTTCAAAAATGCCAGAAATCAACGCCAAGGTGGCTGAATCCACAGAGGAGCCTG 
CTGGTTCTAATCGGGGACAGTATCCTGAGGATTCCTCAAGTGATGGTTTAAGGCAAAGGGAAGTTCTTCG 
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faapPTTTCTTCCCCTGGATGGGAAAACATCTCAAGGCCTGAAGCTGCCCAGCAGGCATTCCAAGGCCTG 

^gScctJctcctgacgttgtaaatcaggacccc^ 

TGAAACTGAAGACCCCAACCACCrCCCTCCAGACAGGGATGTACT 

TCGCAAACTOATGGTGTTTGTGCTGTAGCTGTTGGAGGCTTTGACAGGAATGGACTGGATCACCTGACTC 
CA^CTA^ATTGCCTCTCCTGGACATGGC^ 

SotScIIgcaIaagcagaaacgtgaagccgtgataca^ 

TCATGTCTTTATTCTGAAGAGCTTTAATATATACTCTATGTAGTTTAATAAGCACT^^ 

^GGTGTTGCATGTCTATGCTTGAGGAACTrrTCCAAATGTGTGTGTCTGCATGTGTGm 

AAGTCATAGATGCAGARGTGGTTCTGCTGGTACGAriTGATTCCTGTTGGAATGTTTAAATTACACTAAG 

TGTACTACTTTATATAATCAATGAAATTGCTAGACATGTTTT^ 

tSaattgctc?™ 

CTGGGTTTTCTGTTCAATAAAGTTTTACTATGAATGACCCTGAAAAAAAAAAAAAAAAAAAAAA 

Human HERPUD1 mRNA sequence - var9 (public gi: 2S596o) (SEQ ID NO: 85) 

CGTGAACGGTCGTTGCAGAGAT^C<^GCGGCTGAGACGCCCK:CTGCCTGGCACCTA(MAGCGCAGCGGA 

CCCCAACCAGCGCCACCGCGACTTGGAGCTGAGTGGCGACCGCGGCTGGAGTGTGGGCCACCTCAAGGCC 

^TTGGATCACCAATGTCTCAGGGACTTGCTTCCAAAGCAGGAAAAACGGCATGTTTTGCATCT^GTGTG 
*CAATGTGAAGAGTCCTTCAAAAATGCCAGAAATCAACGCCAAGGTGGCTGAATCCACAGAGGA 
GGTTCTAATCGGGGACAGTATCCTGAGGATTCCTCAAGTGATGGTTTAAGGCAAAGGGAAGTTCTTCGGA 
ACCTTTCTTCCCCTGGATGGGAAAACATCTCAAGGCCTGAAGCTGCCCAGCAGGCATTCCAAGGCCTGGG 
TCCTGGTTTCTCCGGTTACACACCCTATGGGTGGCTTCAGCTTTCCTGGTTCCAGCAGATATATGCACGA 
CAGTACTACATGCAATATTTAGCAGCCACTGCTGCATCAGGGGCTTTTGTTCCACCACCAAGTGCACAAG 
AGATACCTGTGGTCTCTGCACCTGCTCCAGCCCCTATTCACAACCAGTTTCCAGCTGAAAACCAGCCTGC 
CAATCAGAATGCTGCTCCTCAAGTGGTTGTTAATCCTGGAGCCAATCAAAATTTGCGGATGAATGCACAA 
GGTGGCCCTATTGTGGAAGAAGATGATGAAATAAATCGAGATTGGTTGGATTGGACCTATTCAGCAGCTA 
CATTTTCTGTTTTTCTCAGTATCCTCTACTTCTACTCCTCCCTGAGCAGATTCCTCATGGTCATGGGGGC 
CACCGTTGTTATGTACCTGCATCACGTTGGGTGGTTTCCATTTAGACCGAGGCCGGTTCAGAACTTCCCA 
AATGATGGTCCTCCTCCTGACGTTGTAAATCAGGACCCCAACAATAACTTACAGGAAGGCACTGATCCTG 
AAACTGAAGACCCCAACCACCTCCCTCCAGACAGGGATGTACTAGATGGCGAGCAGACCAGCCCCTCCTT 
TATGAGCACAGCATGGCTTGTCTTCAAGACTTTCTTTGCCTCTCTTCTTCCAGAAGGCCCCCCAGCCATC 
GCAAACTGATGGTGTTTGTGCTGTAGCTGTTGGAGGCTTTGACAGGAATGGACTGGATCACCTGACTCCA 
GCTAGATTGCCTCTCCTGGACZATGGCAATGATGAGTTTTTAAAAAACAGTGTGGATGATGATATGCTTTT 
GTGAGCAAGCAAAAGCAGAAACGTGAAGCCGTGATACAAATTGGTGAACAAAAAATGCCCAAGGCTTCTC 
ATGTGTTTATTCTGAAGAGCTTTAATATATACTCTATGTAGTTTAATAAGC^CTGTACGTAGAAGGCCTT 
AGGTGTTGCATGTCTATGCTTGAGGAACTTTTCCAAATGTGTGTGTCTGCATGTGTGTTTGTACATAGAA 
GTCATAGATGCAGAAGTGGTTCTGCTGGTAAGATTTGATTCCTGTTGGAATGTTTAAATTACACTAAGTG 
TACTACTTTATATAATCAATGAAATTGCTAGACATGTTTTAGCAGGACTTTTCTAGGAAAGACTTATGTA 
TAATTGCTTTTTAAAATGCAGTGCTTTACTTTAAACTAAGGGGAACTTTGCGGAGGTGAAAACCTTTGCT 

GGGTTTTCTGTTCAATAAAGTTTTACTATGAATGACCCTG 

Human HERPUD1 mRNA sequence - var 10 (public gi: 76si869) (SEQ ID NO: 86) 

GACGTGAACGGTCGTTGCAGAGATTGCGGGCGGCTGAGACGCCGCCTGCCTGGCACCTAGGAGCGCAGCG 
GAGCCCCGACACCGCCGCCGCCGCCATGGAGTCCGAGACCGAACCCGAGCCCGTCACGCTCCTGGTGAAG 
AGCCCCAACCAGCGCCACCGCGACTTGGAGCTGAGTGGCGACCGCGGCTGGAGTGTGGGCCACCTCAAGG 
CCCACCTGAGCCGCGTCTACCCCGAGCGTCCGCGTCCAGAGGACCAGAGGTTAATTTATTCTGGGAAGCT 
GTTGTTGGATCACCAATGTCTCAGGGACTTGCTTCCAAAGCAGGAAAAACGGCATGTTTTGCATCTGGTG 
TGCAATGTGAAGAGTCCTTCAAAAATGCCAGAAATCAACGCCAAGGTGGCTGAATCCACAGAGGAGCCTG 
CTGGTTCTAATCGGGGACAGTATCCTGAGGATTCCTCAAGTGATGGTTTAAGGCAAAGGGAAGTTCTTCG 
GAACCTTTCITCCCCTGGATGGGAAAACATCTCAAGGCCTGAAGCTGCCCAGCAGGCATTCCAAGGCCTG 
GGTCCTGGTTTCTCCXSGTTACaCACCCTATGGraTGGCTTCAGCrTTCCTGGTTCCAGCAGATATATGCAC 
GACAGTACTACATGCAATATTTAGCAGCCACTGCTGCATCAGGGGCTTTTGTTCCACCACCAAGTGCACA 
AGAGATACCTGTGGTCTCTGCACCTGCTCCAGCCCCTATTCACAACCAGTTTCCAGCTGAAAACCAGCCT 
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GCCACCGTTGTTATGTACCTGCATCACGTTC 
CAAATGAT^TCCTCCTCCTGACGTTGTAAATCAGGACC^ 
TGAAACTGAAGACCCG^CCACCTCCCT 
TTTATGAGCACAGCATGGCTTGTCTTCAAGA^^ 

TCGCAAACTGATGGTGTTTGTGCTGTAGCTGTXGGAGGCTTTGACAGGAATGGACTGGATCACCTGACTC 
^GCTAGATTGCCTCTCCTGGACATGGCAATGATGAGTTTTTAAAAAAC^GTC 

Sgt^gcaagcaaaagcag^ 

TCATGTCTTTATTCTGAAGAGCTTTAATATATACTCTATGTAGTTTAATAAGCACTGTACGTAGAAGGCC 
TTAGGTGTTGCATGTCTATGCTTGAGGAACTTTTCCAAATGTGTGTGTCTGCATGTGTGTTTGTACATAG 
AAGTCATAGATGCAGAAGTGGTTCTGCTGGTACGATTTGATTCCTGTTGGAATGTTTAAATTACACTAAG 

TGTACTACTTTATATAATCAATGAAAT^^ 

TATAATTGCTTTTTAAAATGCAGTGCTTTACTTTAAACTAAGGGGAACTTTGCGGAGGTGAAAACCTTTG 
CTGGGTTTTCTGTTCAATAAAGTTTTACTATGAATGACCCTGAAAAAAAAAAAAAAAAAAAAAA 

Human HERPUD1 Protein sequence - varl (public gi: 16507802) (SEQ ID NO: 249) 

MESETEPEPVTLLVKSPNQRHRDIjELSGDRGWSVGHIjKAHLSRVYPERPRPEDQRLIYSGKLLLDHQCLR 
DLLPKEKRHVLHLVCWKSPSKMPEINAKVAESTEEPAGSNRGQYPEDSSSDGIiRQREVLRNLSSPGWEN 
I SRHHVGWFP FRPRPVQNFPNDGPPPDWNQDPNNNLQEGTDPETEDPNHIiPPDRDVIiDGEQTSPSFMST 
AWIiVFKTFFASLLPEGPPAIAN 

Human HERPUD1 Protein sequence - var2 (public gi: i044i9ii) (SEQ ID NO: 250) 

MQ YIiAATAASGAFVPPP S AQE I P WSAPAPAP IHNQF PAENQ PANQNAAPQVWNPGANQNIjRMNAQGGP 
IVEEDDEINRDWLDWTYSAATFSVFLSIIjYFYSSLSRFIjMVMGATVVMYLHHVGWFPFRPRPVQNFPNDG 
.PPPDWNQDPNI^LQEGTDPETEDPNHLPPDRDVLDGEQTSPSFMSTAWLVFKTFFASLLPEGPPAIAN 




Unigene Name : HLA-A Unigene ID : Hs . 181244 Clone ID : GD_159 
Human HLA-A mRNA sequence - varl (public gi: 575248) (SEQ ID NO: 87) 

ATGGCCGTCATGGCGCCCCGAACCCTCGTCCTGCTACTCTCGGGGGCTCTGGCCCTGACCCAGACCTGGG 
CGGGCTCTC^CTCCATGAGGTATTTCTT^CATCCGTGTCCCGGCCCGGCCGCGGGGAGCCCCGCTTC^T 
CGCAGTGGGCTACGTGGACGACACGCAGTTCGTGCGGTTCGACAGCGACGCCGCGAGCCAGAGGATGGAG 
CCGCGGGCGCCGTGGATAGAGCAGGAGGGTCCGGAGTATTGGGACGGGGAGACACGGAAAGTGAAGGCCC 

ACTCACAGACTCACCGAGTGGACCTGGGGACCCTGCGC^ 

GACCGTCCAGAGGATGTATGGCTGCGACGTGGGGTCGGACTGGCGCTTCCTCCGCGGGTACCACCAGTAC 
GCCTACGACGGCAAGGATTAC^TCGCCCTGAAAGAGGACCTGCGCTCTTGGACCGCGGCGGACATGGCAG 
CTCAGACCACC^^GCACAAGTGGGAGGCGGCCCATGTGGCGGAGCAGTTGAGAGCCTACCTGGAGGGCGA 
GTGCGTGGAGTGGCTCCGCAGATACCTGGAGAACGGGAAGGAGACGCTGCAGCGCACGGACGCCCCCAAA 
ACGCATATGACTCACCACGCTGTCTCTGACCATGAAGCCACCCTGAGGTGCTGGGCCCTGAGCTTCTACC 
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OTGCGGAGATCACACTGACCTGGCAGCGGGATGGGGAGGACCAGACCCAGGACACGGAGCTCGTGGAGAC 

SggcctccaggSatg^ 

SScSg^^ca^tgagggtttgcccaagcccctcaccctgagatgggagccgtot 
SScatcctcatcgtgggcatca^ 

cgctgctgtgatgtggaggaggaagagctcagatagaaaaggagggagctactctcaggctgcaagcagt 
gacagtgcccagggctctgatgtgtctctcacagcttgtaaagtgtga 



Human HLA-A mKNA sequence - var2 (public gi: 187857) (SEQ ID NO: 88) 

atggSgotggcgccccg^ 

Sggctcccactccatgaggtat^ 

Sccgtgggctacgt^ 

ccgcgggcgccgtggatagag<^ggac^ggccggagtattgggaccggaacacacg^ 

agt^cagactgaccgagtggacctggggaccctgcgcggctactacaaccagagcgaggccggttctca 

caccatccagatgatgtatggctgcgacgtggggtcggacgggcgcttcctccgcgggtaccggcaggac 

gcctacgacggcaacsgattac^tcgccctgaaagaggacctgcgctcttggaccgcggcggacatg^ 

ctcagaccaccaagcacaagtgggaggcggcccatgtggcc^agcagtggagagcctacctggagggcac 

gtgcgtggagtggctccgcagatacctggagaacgggaaggagacgctgcagcgcacggacgcccccaaa 

acgcatatgactcaccacgctgtctctgaccatgaagccaccctgaggtgctgggccctgagcttctacc 

ctgcggagatcacactgacctggcagcgggatggggaggaccagaccc^ggacacggagctcgtggagac 

caggcctgcaggggatggaaccttccagaagtgggtggctgtggtggtgccttctggacaggagcagaga 

tacacctgccatgtgcagcatgagggtttgcccaagcccctcaccctgagatgggagccgtcttcccagc 

CCACCATCCCCATCGTGGGCATCATTGCTGGCCTGGTTCTCTTTGGAGCTGTGATCACTGGAGCTGTGGT 

cgctgctgtgatgtggaggaggaagagctcagatagaaaaggagggagctactctcaggctgcaagcagt 

GACAGTGCCCAGGGCTCTGATGTGTCTCTCACAGCTTGTAAAGTGTGA 

Human HLA-A protein sequence - varl (public gi: 575249) (SEQ ID NO: 253) 

MAVMAPRTIiVLIiIiSGALAIjTQTWAGSHSMRYFFTSVSRPGRGEPRPIAVGYVDDTQFVRFDSDAASQRME 
PRAPWIEQEGPEYVroGETRKVKAHSQTHRVDLGTLRGYYNQSEJVGSHTVQRMYGCDVGSDWRFLRGYHQy 
AYDGKDYIALKEDLRSWTAADMAAQTTKHKWEAAHVAEQLRAYLEGECVEWLRRYI.EWGKETLQRTDAPK 
THMTHHAVSDHEATI.RCWALSFYPAEITLTWQRDGEDQTQDTELVETRPAGDGTFQKWAAVVVPSGQEQR 
YTCHVQHEGLPKPLTLRWEPSSQPTIPIVGIIAGLVLFGAVITGAWAAVMWRRKSSDRKGGSYSQAASS 

DSAQGSDVSLTACKV 
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Uniqene Name : HLA-B Unigene ID : Hs. 77961 Clone ID: 3GD_1122 
HuiBaii' HLA-B mKNA sequence - vart (public gi: 3 2 is 8) (SEQ ID NO: 89) 

SSctggStacgtggacgacacgcagttcgtgaggttcgacagcgac^^ 
^ccgcgggScgtggatagagcaggaggggccggagtattgggaccgggagacaca^ 

aSScIgactta^^ 
cacIStccaga^^ 

SScgacggcaaggattacatcgccctgaacgaggacctgagct 

CTCAGATCACCCAG^CAAGTCGGAGGCGGCCCGTGAGGCGGAGCAGC 

Sgcgtggagtggctccgcaga^ 

ACACATG^CCCACCACCCCATCTCTGACCATGAGGCCACCCTGAGGTGCTGGGCCCTC^ 

ctccggagatcacactgacctggca^ 

CAGACCAGCAGGAGATAGAACCTTCCAGAAGTGGGCAGCTGTGGTGGTGCCTTCTGGAGAAGAGCAGAGA 

TACACATGCCATGTACAGCATGAGGGGCTGCCGAAGCCCCTCACCCTGAGATGGGAGCCATCTTCC 

CCACCATCCCCATCGTGGGCATTGTTGCTGGCCTGGCTGTCCTAGCAGTTGTGGTCATCGGAGCTGTGGT 

GACAGTGCCCAGGGCTCTGATGTGTCTCTCACAGCTTGA 

Human HLA-B protein sequence- varl (public gi: 32189) (SEQ ID NO: 254) 

^IVTAPRTVIjIiLLSGAIj7^TETWAGSHSMRYFYTAMSRPGRGEPRFISVGYVDDTQFVI^DSDAASPREE 

PRAPWIEQEGPEYWDRETQISKTNTQTYRESLRNIjRGYYNQSEAGSHTLQRMyGCDVGPDGRIjIiR 

AYDGKDYIALNEDLSSWTAADTAAQITQRKWEAAREAEQLRAYIiEGIiCV^ 

•THVTHHPISDHEATLRCWAIK3FYPAEITLTWQRDGEDQTQDTELVETRPAGDRTFQ^AA 

YTCHVQHEGLPKPLTLRWEPSSQSTIPIVGIVAGLAVLAVWIGAWATVMCRRKSSGGKGGSYSQAASS 

DSAQGSDVSLTA 

paiqene Name -. MSTP028 Unigene ID : Hs. 302746 Clone ID: GD_1119 
Human MSTP028 mRNA sequence - varl (public gi: 14042294) (SEQ ID NO: 90) 

CCCCGCCTCCGCCCCCGGCTGGCGTGAGCTGGGTGTTTCCTGCCTCTCTCAGTCCGGGTTTGGAGACTCC 

TGCGTCCTCCGACTTTTCGTGGAAGAGATGTCAGGAGAAAGTGTGGTGAGCTCAGCGGTGCCAGCGGCTG 

CTACCCGCACCACTTCCTTCAAGGGCACGAGCCCCAGCTCCAAATACGTGAAGCTGAATGTGGGTGGAGC 

CCTCTACTATACCACCATGCAGACGCTGACCAAGCAGGACACCATGCTGAAGGCCATGTTCAGCGGGCGC 

ATGGAAGTGCTCACCGACAGTGAAGGCTGGATCCTCATTGACCGCTGTGGGAAGCACTTTGGTACGATAC 

TCAACTACCTTCGAGACGGGGCGGTGCCTTTACCCGAGAGCCGCCGGGAGATCGAGGAGCTGCTAGCAGA 

AGCCAAGTACTACCTAGTCCAAGGCCTGGTGGAAGAGTGCCAGGCGGCCCTACAAAACAAAGATACTTAT 

GAGCCTTTCTGCAAGGTCCCTGTGATCACCTCATCCAAGGAAGAACAAAAA.CTTATAGCGACTTCAAATA 

AGCCAGCCGTGAAGTTGCTCTACAAC^AAGTAACAACAAATACTCATATACCAGCAATTCTGACGACAA 

TATGTTGAAAAACATTGAACTGTTTGATAAGCTGTCTCTGCGCTTTAACGGAAGGGTCCTGTTCATAAAG 

GATGTTATTGGGGATGAAATCTGCTGCTGGTCCTTTTATGGTCAGGGCCGGAAGATTGCTGAAGTCTGTT 

GTACCTCCATCGTCTATGCCACTGAGAAGAAACAGACCAAGGTGGAGTTTCCCGAAGCCCGGATTTATGA 

GGAGACCCTGAACATTTTGCTGTATGAGGCCCAGGATGGCCGGGGACCTGACAATGCGCTCCTGGAGGCC 

ACAGGCGGGGCGGCGGGGCGCTCCCACCACCTGGACGAGGACGAGGAGCGGGAGCGGATCGAGCGCGTGC 

GGAGGATCCACATCAAGCGCCCTGATGACCGGGCCCACCTCCACCAGTGAGCAGGCAAGAGACCGAGCCG 

CCCTCCTCTCACCGCCCCCACTCCCTGCCGTGCTACACCCAGATCCTGTGCAGGCTGCCGGGCCCCTTCT 

GCTTCCCTTGGAGCCTGGAGATACTTTTGTAACAAGCCAGATGATTATTTTGGTATTGCTTGACAAGGCA 

AATTGATTGTCTTGACCCAGGCGTATGACCCCTGTCGTTGAACAAGCTGTGTCTAAGATCTCTACTTTTC 

ATGAGAATCTGAGACTCTTTGGAGCCAGGCTTTCTCGGTTCTCAGAGGAAAAGTATGAATGAGTGTGAAG 

TGTATGTGAGAACTTTTGTTTGCAATATTTATTTTTGTGGGTGTCGGCTTCCTATGTGGGCTTTTTGGGT 

GACACTCCCTTAAGGGTTCAGTTTGACAATTCTGAGAGTTGTCCTGCAGTTGGAGGCCACCAGAGGTATC 

TGAGCTCCCTGCTTCCTATTTCaTAATCCTCCAGCCCCAGCA^MTC<^CTCCTGGTTCCTGTGTGTTTGG 

CCCGGGCACAATCCCCACTGCTTTGCTAGACGTGCTTTCTGCCATGTGGCTTTGGGCCTAGAGCTTGTTG 

ATAATTGCAGCTTGTGGCAGGGGAAATATGGCTGAATGAGCGTCTAAATCGTTGAGACCAGTGCAACTTT 

GGGTGCAAGGCTTTGTTTAGGGATCAAGCCTTTTGCCACCTTGGGCTGGTCTTTGGCCTGGTGCTCACTG 

GGACCCCATATGTCTGCGTAGGAGC^GAACTTTCCATGGCAGTAAGTGTCCAGCTCTGTTTCTGGTTCTT 

TCCCCAACTCCAGCCCCGTCC^GTTGTTCTCCTGATTGACCCGACTCCACTCCAGGAAGGCCATCTGACC 
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TTGTCWTQCCTC2TQA 

ATAGATAACTTTGGGTCTGGTTTGTGTCTGTGTTCATGTTCG 
- GGACGTGTGCTTGTGGGGCACAGGTGGCGGCCCCTGCTGGAGCCCGGCTGGGCGCAGCGCCTATGTAGGA 

CGGGTGTTCTCAGTGACCTACCTCCCAGGCTCCTCTGCACCTGCAAAGGAACAGGAGTGAGTCGTGACTG 

ACAGGGGTGGTTGAGACTAGACTAGGTAGAGTAGTTACCAGGAGATGTGAATGTGCGTCAGGTGATGGAT 

GGGTTTGTCAAGGGAATCGTTACCGTTTTATACCAAAGGTATTAACATGGGC^GCCTTTGACACATGTAT 

TCCAAAAACGAGTTTATATTTTCAAACGGTTTTTACAGCT^ 
CAGTTGTATGCCTTCATTTTGTATCCAACAGCA^ 

Human MSTP028 mRNA sequence - var2 (public gi: 13994352) (SEQ ID NO: 91) 

GGAGACTCCTGCGTCCTCCGACTTTTCATGGAAGAGATGTCAGGAGAAAGTGTGGTGAGCTCAGCGGTGC 
CAGCGGCTGOTACCCGCACGACTTCCTTCAAGGGCACGAGCCCC^ 

GGGTGGAGCCCTCTACTATACCACCATGCAGACGCTGACCAAGCAGGACACCATGCTGAAGGCCATGTTC 
AGCGGGCGCATGGAAGTGCTCACCGACAGTGAAGGCTGGATCCTGATTGA 

GTACGATACTCAACTACCTTCGAGACGGGGCGGTGCCTTTACCCGAGAGCCGCCGGGAGATCGAGGAGCT 
GCTAGCAGAAGCCAAGTACTACCTAGTCCAAGGCCTGGTGGAAGAGTGCCAGGCGGCCCTACAAAACAAA 
GATACTTATGAGCCTTTCTGCAAGGTCCCTGTGATCACCTCATCCAAGGAAGAACAAAAACTTATAGCGA 
CTTCAAATAAGCCAGCCX5TGAAGTTGCTCTACAACAGAAGTAACAACAAATACTCATATACCAGGAATTC 
TGACGACAATATGTTGAAAAACATTGAACTGTTTGATAAGCTGTCTCTGCGCTTTAACGGAAGGGTCCTG 
TTCATAAAGGATGTTATTGGGGATGAAATCTGCTGCTGGTCCTTTTATGGTCAGGGCCGGAAGATTGCTG 
AAGTCTGTTGTACCTCCATCGTCTATGCCACTGAGAAGAAACAGACCAAGGTGGAGTTTCCCGAAGCCCG 
GATTTATGAGGAGACCCTGAACATTTTGCTGTATGAGGCCCAGGATGGCCGGGGACCTGACAATGCGCTC 
CTGGAGGCCACAGGCGGGGCGGCGGGGCGCTCCCACCACCTGGACGAGGACGAGGAGCGGGAGCGGATCG 
AGCGCGTGCGGAGGATCCACATCAAGCGCCCTGATGACCGGGCCCACCOTCACCAGTGAGGAGGCAAGAG 
ACCGAGCCGGCCTCCTCTCACCGCCCCCACTCCCTGCCGTGCTA(^CCCAGATCCTGTGCAGGCTGCCGG 
GCCCCTTCTGCTTCCCTTGGAGCCTGGAGATACTTTTGTAACAAGCCAGATGATTATTTTGGTATTGCTT 
GACAAGGCAAATTGATTGTCTTGACCCAGGCGTATGACCCCTGTCGTTGAACAAGCTGTGTCTAAGATCT 

CTACTTTTCATGAGAATCTGAGACTCTTTGGAGC^ 

AGTGTGAAGTGTATGTGAGAACTITTGTTTGCAATATTTATTTTTGTGGGTGTCGACTTCCTATGTGGGC 
TTTTTGGGTGACACTCCCTTAAGGGTTCAGTO 

AGAGGTATCTGAGCTCCCTGCTTCCTATTTCATAATCCTCCAGCCCCAGCAGGTCCACTCCTGGTTCCTG 
TGTGTTTGGCCCGGGCACAATCCCCACTGCTTTGCTAGACGTGCTTTCTGCCATGTGGCTTTGGGCCTAG 
AGCTTGTTGATAATTGCAGCTTGTGGCAGTGGAAATATGGCTGAATGAGCGTCTAAATCGTTGAGACCAG 
TGCAACTTTGGGTGCAAGGCTTTGTTTAGGGATCAAGCCTTTTGCCACCTTGGGCTGGTCTTTGGCCTGG 
TGCTCACTGGGACCCCATATGTCTGCGTAGGAGCAGAACTTTCCATGGCAGTAAGTGTCCAGCTCTGTTT 
CTGGTTCTTTCCCCAACTCCAGCCCCGTCCAGTTGTTCTCCTGATTGACCCGACTCCACTCCAGGAAGGC 
CATCTGACCCTGTGACAGGCATAGCTCATAAACTACCCCTCCCTGGGATCCCGCTCCTCTTCAGCCTCCT 
TCCCCATGAAGCTGGGCTAACTTTCTAAGTCATTTTGCTTAGAAATTCAGTGTGGCCCATACCCTTTGTC 
CTCCC^GCCTGGCATCCAGGCAGGGACACCCTCACACCACCAGCCCCAGGGAGCTTCCCTGCTATAAACA 
C^GACCCCCTTGTCTTTGCCTCTGATTTTTACAC^GTGTAGAGTGGCCAGCAGTGAAC^GGTTGAGGAT^ 
TGCGGGTAGATAGATAACTTTGGGTCTGGTTTGTGTCTGTGTTCATGTTTGTTTAAGGGATATGTGTGAC 
TGTGGGTGGGGACX3TGTGCTTG1K^GGCACAGGTGGCGGCCCCTGCTGGAGCCCGGCTGGGCGCAGCGCC 
TATGTAGGACGGGTGTTCTCAGTGACCTACCTCCCAGGCTCCTCTGCACCTGCAAAGGAACAGGAGTGAG 
TCGTGACTGACAGGGGTGGTTGAGACTAGACTAGGTAGAGTAGTTACCAGGAGATGTGAATGTGCGTCAG 
GTGATGGATGGGTTTGTCAAGGGAATCGTTACCGTTTTATACCAAAGGTATTAACATGGGCAGCCTTTGA 

CAGATGTATTCCAAAAACGAGTTTATATTTTCAAACGGTT^ 

TGCCTGTGACAGTTGTATGCCTTCATTTTGTATCCAACAGCAAAGTCTACAATAAAACTTTAAAACAATC 
ATGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human MSTP028 tnRNA sequence - var3 (public gi: 25303941) (SEQ ID NO: 92) 

CCGGGTTTGG AGACTCCTGCX3TCCTCCGACTTTTCATGGAAGAGATGTCA.GG AGAAAGTGTGGTGAGCT C • 

AGCGGTGCCAGCGGCTGCTACCCGCACCACTTCCTTCAAGGGCACGAGCCCCAGCTCCAAATACGTGA^ 

CTGAATGTGGGTGGAGCCCTCTACTATACCACCATGCAGACGCTGACCAAGCAGGACACCATGCTGAAGG 

CCATGTTCAGCGGGCGCATGGAAGTGCTCACCGACAGTGAAGGCTGG^ 
GCACTTTGGTACGATACTCAACTACCTTCGAGACGGGGCGGTGCCTTTACCCGAGAGCTC 
GAGGAGCTGCTAGCAGAAGCCAAGTACTACCTAGTCCAAGGCCTGGTGG71AGAGTGCCAGGCGGCCCTAC 
AAAACAAAGATACTTATGAGCCmTCTGCAAGGTCCCTGTGATCACCTCATCCAAGGAAGAACAAAAACT 

TATAGCGACTTCAAATAAGCC^GCCGTGAAGTTGCTCTACAACAGAAGTJ^ 
AGCAATTCTGACGACAATATGTTGAAAAACATTGAACTGTTTGATAAGCTGTCT 
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rr^TCCTGTTCATAAAGGATGTCATTGGGGATGAAATCTGCTGCTGGTCCTTTTATGGTCAGGGCCGGAA 

a^S?Sggaggccacaggcggggcggcggggcgctcccaccacctgg 

gIcaaSg^Sagccgccctcctctcaccgcccc^^ 

tattccttga^aag^ 

TAAGATCTCTACTTTTCATGAGAATCTGAGACT 

SaStgagtgt^^ 

ATGTGGGCTTTTTGGGTGACACTCCCTTAAGGGT^ 
AGGCCACCAGAGGTATCTGAGCTCCCTGCTTC^ 

GG^CTAGAGCTGTTGA^ 

GAGACCAGTGCAACTTTGGGTGCAAGGCTTTGTTTAGGGATCAAGCCT 
TGGCCTGGTGGTCACTGGGACCCCATATGTCTGC^ 

agSaJggccatSLccctgtgacaggcatagctcataaactacc 
tataaacacagacccccttgtctttgcctctga^ 

tgag^tgtgcgggtagatagataactttgggtctggtttgtgtctgtgttcatgtt 

TGTGACTGTGGGTGGGGACGTGTGCrrGTGGGGCACAGGTGGCCCCTGCTGGAGCCCGGCTGGGCGCAGC 

GCCTATGTAGGACGGGTGTTCTCAGTGACCTACCTCCCAGGCTCCTCTGCACCTGCAAAGGAACAGGAOT 

GAGTCGTGACTGACAGGGGTGGTTGAGACTAGACTAGGTAGAGTAGTTACCAGGAGATGTGAATGTGCGT 

CAGGTGATGGATGGGTTTGTCAAGGGAATCGTTACCGTTTTATACCAAAGGTATTAACATGGGCAGCCTT 

TGACACATGTATTCCAAAAACGAGTTTATATTTTCyiAACGGTTTTTACAGCTTAGACTTTGTACTTAC^ 

CCCTGCCTGTGACAGTOGTATGCCTTCATTTTGTATCCyACAGCAAAGTCTACU^TAAAACTTTAAAACA 

ATCATGACTGAATGTCAAAATCGTGTATTGGGCAGATGCTTTTTAAACTGTCGTGTGAGAAACTTTTATA 

XTAGGCCATTTGGATTTTATTAAGTGCTAAGGAAAGAGGGCTTACAAAATGTTTCGTAAATATTTTATAC 

TGTTTAAGTGTTAAACACCAACCCTGTCTTTCTTTTGGGTTGAGCTTTTTO 



TGGAGTGTAGTGGC^CCATCTCCACTTACCACAACrTGTGCCTCCTGGGTTCAAGCGATTCTGCTGCCTC 

AGCCTCCCGAGTAGCTGGGATTGCAGGTACCCATCAGCCCATGCCCAGCTAATTTTGTATTTTTAGTAGA 

GATGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCCTGTGATCCGACCACCTTGGCCTC 

CCAAAGTGCTGGGATTACAGGTGTGAGTCACCACACCTGGCTGCATAGTGTTTTAAATGTTTGTGTGAAG 

AATGAGTTTGTGGAACAATTTGATTTGCTGTGGCCTCTATGCCTAATGAGCTAGTGTTTCTGGCAGCTCT 

CTCTACCCAACTTTGCACTTGTAGTTTTGAGTCTTTGTCTCTCTGGAATATGAACAGGTTTATAAAACAT 

TCCATGGTGAACAATTCTGTCGGCTGCATTATAGCCATGAGTGAATAGACAGCATTGGCTGGTCCAAGCT 

CTGTTATTGAGTATACAAGGAAOTGATTTTTCTTATGTTAGCACTAAGGGCAAAAACCAATATTTATAAT 

GTAAGCACTATCCAGGTAAAACACTGGCCC7VAGATTTGGTAAAGAGATTTCATTGCAATGTAATAACTAC 

AGTTTTTTACAAATTGGAACAGCTTTGGTGTGTCGTAATCAAGGGTTTTTTTTGTTTGTTTGTTTCAAAT 

AAGCCATCTGATTGTGGTGACTGGGGCCCATGTCCAAGACAATTCCTGGCATATTCTGTCACCCTCCCGT 

GGGGCGATCACTGTGTGGGGACCCCATTCCCCAGTTAAAGTGTGTCTCTGTACCTTACAACAGCGATTCA 

GGACCCAAGTGTGAACAACACTCAGCCCGCCCTCTGGAGCGTGTGCTCTCTTTAGGGCTCTACCCAAAGT 

CACTGTAAC^GTTAAGTGTGTCATTAACCTTTCTGTCTCTTTGCGCCATAAAAAAATGCTCAAAGTTTTA 

GATGTAGCCACTGTATGTTGTACAAACGTTGGCGACATGTAAAATAAAAGTCATAAAATGCAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



Human MSTP028 mRNA sequence - var4 (public gi: 16552440) (SEQ ID NO: 93) 

AGTCCGGGTTTGGAGACTCCTGCGTCCTCCGACTTTTCATGGGCCCTGACATGGCAGGTGATATCCAGGA 
CACTGTTGGGTGCCATGGAGTTGGGGAGAGTTGGCCAGAAGAGTTGGATAACCTTGAATTGAATATTGTC 
CATGACTGTCGCGTTGCTTCTGCTGTTGCAAGCTGCCTCCCTTTACTCCCAGTCCCATTTACAAAAATAA 
CGCTTGTGTTrTACCAGTTATAGTTGTAGTACCCATTCATTATAGAAAATCTGGAAAAGCTAGACAATTC 
TTTTTCAGTTTCAGGGAATAGTTCAAACAAGTTATGTGCTGTCAGTGCCCTGCAGCCAAAAAGCACGAGG 
AGCATACCTGTAGTCAAGCAAAGTTGGGTTTATTTCCTTGTTGCATTGGGGTGGGGAAGAACTGTGGGAC 
ATCTC^GAGAAGGGOTGTCGGCTTGTGTTGGGTGATTTGAGAGACAGTTCAGAGAAGTGGGGCTTTGCTC 
TGTGTTGGATGCTGCTGGGAAGCAGGGCTAATTCTGTGATTGGGTCTCAGTGATTCCTGACTTGAAAGCA 
GGAAGAATGGAAGGAGGCTAAACTTCTCATTGGTAAAGCAGCAGCTGTAACTCCTATTAGCCAGGATAGG 
GGATCTTTGGTCATTTTTGTATTTTGGATAATGTTTATGTTTTTGTCTGTGTCCGGACATGATGACTGAA 
TGGTCCTGTTTTTGTCTTGTCGCAAGGGCACAGAGTGGCCTTGTCTGAGGGTGATGTGCTGTGAAAAACT 
GTTGATGTTCAATGGGAATGGTAGGGCCAGCCGTGGGGGCTACCCCAGATTCAGCAAAGATTCTGCCCAC 
CCTTG(^CATTTCCACCTCTACAGTTTTACCTGTTCATTTCAGACATGTTTGCTCTGAGTACACATGTGC 
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CGGATACCAATCTCACTTTdcAGGCCTGCGTAAATCAGCCACT 

AGTCAGCCATGCTATCAGGGAGATGGTAGTGGGATCTTGCTCTTTTGGGCAGCACTAGTCTAGGAGGTCT 

AATTTTGCAATAACTTGGTTCCAAAAGTTTCCATC 
CAGGAAGTGATAGGAGTGCGAGCTG 

TGAAAAATGACACTTAGCTAGCTATTGAGTGGTACATGGCAATAAGGAAATGTAAAGAGACCTGGGCAGT 

^ GCTTTAGGCTGTTTTAGGGTGCAGCCAGGGTGTT 

• XAACACAAGAGTTAGGGGCACCCTTGTGCCTGCAGGGTC^ 
'^GGTGGGTCTTGGGACAAACTAGGGGATGCATGGCCCTCTCTAGGGGTCATCCAATACCCCAGCTCTGACC 

AGTTGTTCCCCTGCTAGCCCAGTTGGCCTCTGATTTTAGGAGAAGCCAGAAGTCCAGATTTTTCTGTGAG 

CTCTCCTTAGTTGACCACATTGGAAGCAAACTTTTAAATGCTGTGTATGro 

CTGGAGGCCAGATTGAATCCACAGGCTGAAAGCAGTCAACCAGG 

CeACTGGCAGGAAGAGATGTCAGGAGAAAGTGTGGTGAGCTCAGCGGTGCCAGCGGCTGCTACCCGCACC 

ACTTCCTTCAAGGGCACGAGCCCt^GCTCCAAATACG 
CCACCATGCAGACGCTGACCAAGCAGGACACCATGCTGAAGGCG^^ 

CACCGACAGTGAAGGCTGGATCCTCATTGACCGCTGTGGGAAGCACTTTGGTACGATACTCAACTACCTT 
CGAGACGGGGCGGTGCCTTTACCCGAGAGCCGCCGGGAGATCGAGGAGCTGCTAGCAGAAGCCAAGTACT 
ACCTAGTCCAAGGCCTGGTGGAAGAGTGCCAGGCGGCCCTACAACAGAACAAAGATACTTATGAGCCTT^ 

CTGCAAGGTCCCTGTGATC^CCT^ 

GTGAAGTTGCTCTACAACAGAAGTAACAACAAATACTCATATACCAGCAATTCTGACGACAATATGTTGA 
AAAACATTGAACTGTTTGATAAGCTGTCTCTC^GCTTTAACGGAAGGGTCCTGTTCATAAAGGATGTCAT 
TGGGGATGAAATCTGCTGCTGGTCCTTTTATGGTGAGGGCCGGAAGATTGCTGAAGTCTGTTGTACCTCC 
ATCGTCTATGCCACTGAGAAGAAACAGACCAAGGTGGAGTTTCCCGAAGCCCGGATTTATGAGGAGACCC 
TGAACATTTTGCTGTATGAGGCCCAGGATGGCCGGGGACCTGACAATGCGCTCCTGGAGGCCACAGGCGG 
GGCGGCGGGGCGCTCCCACCACCTGGACGAGGACGAGGAGCGGGAGCGGATCGAGCGCGTGCGGAGGATC 
CACATCAAGCGCCCTGATGACCGGGCCCACCTCCACCAGTGAGCAGGCAAGAGACCGAGCCGCCCTCCTC 
TCACCGCCCCCACTCCCTGCCGTGCTACACCC^GATCCTGTGCAGGCTGCCGGGCCCCTTCTGCTTCCCT 
TGGAGCCTGGAGATACTTTTGTAACAAGCCAGATGATTATTTTGGTATTGCTTGACAAGGCAAATTGATT 
<5TCTTGACCCAGGCGTATGACCCCTGTCGTTGAACAAGCTGTGTCTAAGATCTCTACTTTTCATGAGAAT 
CTGAGACTCTTTGGAGCC^GGCTTTCTCGGTTC 

Human MSTP028 mRNA sequence - var5 (publicgi:2i750697)(SEQIDNO:94) 

GCTGGCGTGAGCTGGGTGTTTCCTGCCTCTCTCAGTCCGGGTTTGGAGACTCCTGCGTCCTCCGACTTTT 

CATGGAAGAGATGTCAGGAGAAAGTGTGGTGAGCTCAGCGGTGCCAGC^ 

TTCAAGGGGkCGAGCCCCAGCTCCAAATACGTGAAGCTXSAATO 

TGCAGACGCTGACCAAGCAGGACACCATGCTGAAGGCCATGTCCAGCGGGCGCATGGAAGTGCTCACCGA 
C^GTGAAGAACAAAGATACTTATGAGCCTTTCTGCAAGGTCCCTGTGATCACCTCATCCAAGGAAGAACA 
AAAACTTATAGCGACTTGAAkTAAGCCAGCCGTGAAGTTGCTCTACT^ 
TATACCAGCGATTCTGACGACAATATGTTGAAAAACAT^^ 

ACGGAAGGGTCCTGTTCATAAAGGATGTTATTGGGGATGAAATCTGCTGCTGGTCCTTTTATGGTCAGGG 
CCGGAAGATTGCTGAAGTCTGTTGTACCTCCATCGTCTATGCCACTGAGAAGAAACAGACCAAGGTGGAG 
TTTCCCGAAGCCCGGATTTATGAGGAGACCCTGAACATTTrGCTGTATGAGGCCCAGGGTGGCCGGGGAC 
CTGACAATGCGCTCCTGGAGGCGACAGGCGGGGCGGCGGGG^^ 

GCGGGAGCGGATCGAGCGCGTGCGGAGGATCCACATCAAGCGCCCTGATGACCGGGCCCACCTCCACCAG 

TGAGCAGGCAAGAGACCGAGCCGCCCTCCTCTC^CCGCCCCCACTCCCTGCCGTGCTACACCCAGATCCT 

GTGCAGGCTGCCGGGCCCCTTCTGCTTCCCTTGGAGCCTGGAGATACTTTTGTAACAAGCCAGATGATTA 

TTTTGGTATTGCTTGACAAGGCAAATTGATOGTCTTGAC^ 

TGTGTCTAAGATCTCTACTTTTCATGAGAATCT^ 

GAAAAGTATGAATGAGTGTGAAGTGTATGTGAGAACTTTTGTTTC 

CTTCCTGTGTGGGCTTTTTGGGTGACACTCCCTTAAGGGTTCAGTTTGACAATTCTGAGAGTTGTCCTGC 

AGtTGGAGGCC^CCAGAGGTATCTGAGCTCCCTGCTTCCrAraTCATAATCCTCCAGCCCCAGCAGGTCC 

ACTCCTGGTTCCTGTGTGTTTGGCCO^GCACAATCCC 

GGCTTTGGGCCTAGAGCTTGTTGATAATTGCAGCTTGTGGC^GTC 

ATCGTTGAGACCAGTGCAACTTTGGGTGCAAGGCTTTGT^ 

GGTCTTTGGCCTGGTGCTCACTGGGACCCCATATGTCTGCGTAGGAGCAGAACTTTCCATGGCAGTAAGT 

GTCCAGCTCTGTTTCTGGTTCTTTCCCCAACTCC^GCCCCGTCCAGTTGTTCTCCTGATTGACCCGACTC 

CACTCCAGGAAGGCCATCTGACCCTGTGACAGGCATAGCTCATAAACTACCCCTCCCTGGGATCCCGCTC 

CTCTTCAGCCTCCTTCCCCATGAAGCTGGGCrAACTTTCTAAGTCATTTTGCT^ 

CCATACCCTTTGTCCTCCCAGCCTGGCATCC^ 

CCCTGCTATAAAC^CAGACCCCCTTGTCTTTGCCTCTGAT^ 

AGAGGTTGAGGATGTGCGGGTAGATAGATAACTTTGGGTCTGGTTTGTGTCTGTGTTCATGTTTGTTTAA 
GGGATATGTGTGACTGTGGGTGGGGACGTGTGCTTGTGGGGCAC^GGTGGCGGCCCCTGCTGGAGCCTGG 
CTGGGCGCAGCGCCTATGTAGGACGGGTGTTCTCAGTGACCTACCTCCC^GGCTCCTCTGCACCTGCTVAA 
GGAAC^GGAGTGAGTCGTGACTGACAGGGGTGGTTGAGACTAGACTAGGTAGAGTAGTTACCAGGAGATG 



Figure 36 part - 51 



p c T/ U S O «+/ O 6 3 O 8 

TGAATGTGCGTCAGGTGATGGATGGGTTTGTCAAGGGAATCGTT^ 
ACTTTAAAACAATCATG 




Human MSTP028 Protein sequence - var2 (public gi: 14042295) (SEQ ID NO: 256) 

TS SKEEQKL I ATSNKPAVKLLYNRSNNKYS YTSNSDDNMLKNI ELFDKLS LiRFNGRVLFI KDVIGDE I CC 
WS FYGQGRKI AEVCCTS I VYATEKKQTKVEFPE7UII YEETLNI LLYE AQDGRGPDNAIiLEATGGAAGRSH 
HLDEDEERERIERVRRIHIKRPDDRAHLHQ 

Unigene Name: PACS-1 Uniqene ID; Hs. 58589 

Human PACS-1 mRNA sequence- varl (public gi: 27781345) (SEQ ID NO: 95) 

AGCACGAGTCTGGTTGTGCCGGAGAAAGTCAAAACTCCCATGAAGTCCAGTAAAACGGATCTC^GGGCT 
CTGCCTCCCCCAGCAAAGTGGAGGGGGTGCACACACCCCGGCAGAAGAGGAGCACGCCCCTGAAGGAGC^ 

GGAGCTCTCCAAGCCCCTAAGTGAGAGGACCAACAGTTC^ 
AGCACGCAGATTCGAAGAAAGGTGGTGTATGACCAGCTGAATCAGAT 

CAGAAAATGTCATTCTGGTGAACACCACTGACTGGCAGGGCCAGTATGTGGCTGAGCTGCTCC 
GCGGAAGCCTGTGGTGTGCACCTGCTCCACCGTGGAGGTCCAGGCCGTGCTGTCCGCCCTGCTCACCCGG 
ATCCAGCGCTACTGCAACTGCAACTCTTCCATGCCGAGGCCAGTGAAGGTGGCTGCTGTGGG^^CCAGA 
GCTACCTGAGCTCCATCCTCAGGTTCTTTGTCAAGTCCCTGGCCAACAAGACCTCCGACTGGCTTO^ 

CATGCGCTTCCTCATC^TCCCCCTCGGTTCT^ 
TACAGTAGTTCCTTCCTGGATTCTGGTTGGAGAGATCTGTT 

AACTGGACGTGGCAGGGCGGGTGATGCAGTACGTCAACGGGGCAGCCACGACACACCAGCTTCCCGTGGC 
CGAAGCCATGCTGACTTGCCGGCATAAGTTCCCTGATGAAGACTCCTATCAGAAGTTTATTCCCTTCATT 
GGCGTGGTGAAGGTGGGTCTGGTTGAAGACTCTCCCTCCACAGCAGGCGATGGGGACGATTCTCCTGTGG 
TCAGCCTTACTGTGCCCTCCACATCACCACCCTCCAGCTCGGGCCTGAGCCGAGACGCCACGGCCACCCC 
TCCCTCCTCCCCATCTATGAGGAGCGCCCTGGCCATCGTGGGGAGCCCTAATAGCCCATATGGGGACGTG 
ATTGGCCTCCAGGTGGACTACTGGCTGGGCCACCCCGGGGAGCGGAGGAGGGAAGGCGACAAGAGGGACG 
CCAGCTCGAAGAACACCCTCAAGAGTGTCTTCCGCTCAGTGCAGGTGTCCCGCCTGCCCCATAGTGGGGA 
GGCCCAGCTTTCTGGCACCATGGCCATGACTGTGGTCACCAAAGAAAAGAACAAGAAAGTTCCCACCATC 
TTCCTGAGCAAGAAACCCCGAGAAAAGGAGGTGGATTCTAAGAGCCAGGTCATTGAAGGCATCAGCCGCC 
TCATCTGCTCAGCCAAGCAGCAGCAGACTATGCTGAGAGTGTCCATCGATGGGGTCGAGTGGAGTGACAT 

CAAGTTCTTCCAGCTGGC^^ 

AGCAAGGCCACOTGAGGCCCTGTCTCCCAGCCACTTTCCC 

CGGGCAGGGGGAGGCCAGC^GGCCCGGGCCCAGCACCCCTTCCCTGGCACCAGGGTCTGCCTCTCACTCG 

CCCAGGTCCC^AAGGACACTGCCACAGGGACGCCTTCCCrCCCCTCCCCTCCAGCCCACCCCn^ 

CCCTCCTCCTTCCCGCTTTTCCCCTTCTCCCTCCTC 

GTGCCCATAGTCCCCTGGACTGAGTCCCCCAGGCCTTCCTTCACCCGACTTCCAAACTCTTCCTTGTGGT 
ATCAGTTTCCTTCTCGGAAATGAGAAAGCTGGAATCCTGGTCCCC^GCAGGAGAGCCTAGTCCTCCCCCA 

GCCCCTCCAGCCACCAGGGTGTCCTCTAGGA^ 
GACCCAAGGCCACTTTCAACTCTTATGGGGTT^ 

GGCACCCTGAGCCCACAGGACCCCTACTTCACAGCTCACAGGGGCAGGAGGCAGCTCCCCTGCCTCCAGG 

ACCCTGTTGCTATGGTGAGACAGCGTTTCTAGGACAGAGGGGCCTCCCAGTCTCCCCCCACCACCCGTGC 

ACGACTTCCTC^CCACCCCCAGGTTCCCTGCAGATGTCGTGTGTGTCCTGAGTGTTTCTTTGGTTCTTTG 

CACGCCAAGTCTCTTGGTTGTACCATGTGACACACCCTGTGCACTGGTCGCTGTCTTCGTGGCTTCCACC 

CTTGTTAATGATGCTCCTGCCTCTGCCTCCCAGCCCCTCACCCAGCACAGCTCTGCCTGGACTTGG 

ATGGGAGGCAGACCCCCACCACCATACATGCTGTCTGTGGCCCCT 

GATCTCCCTCCTCCCACCGTGTCAGTTTTTCTGCC^ 

AGCCTGGGCCCAGACCC^TCCrCCCAGCCAGGTTTCCCTCCAGCAGGCTCCTTCCCTC 

CTCTCACC^CCCGGGGTCTGAGCCCCT^TTCCTGACCGTCCGTGTTCTCAGGAGTGGTTGAGGACACA 

GGGCCCCAGCCGAGCCCTCTGCACCCCCCAGCCCGGCC^TerGCGCCCCACAGCCCCTTTGGAGCTTTTC 

TCTTGTCCTCTCACTCCTTCCCAGAAGTTTT^ 
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CATACCTCCCCCAAGCTCTCCTCCAGCCCTTCCCAGGGCTCA 
CTTAAAAAAAAAAAAAAAAAA 

Human PACS-1 roKNA sequence - var2 (public gi: 309 62 845) (SEQ ID NO: 96) 
gcgccgggggcggcagcggccagcggggatccggg 

GCA^^GCAGCCGCCGCA^ 

GCTCCGCGCCTCCCGGTGGCCCGGGGCCAGGCCGCACCCCCGCCCCGGTGCAGATGAACCTGTACGCCAC 

ctgggaggtggaccggagctcgtccagctgcgtgcctaggctattcagcttgaccot^ 

ATGCTAAAAGAAATGGACAAAGATCTTAACTCAGTGGTCATCGCTGTGAAGCTGCAGGGT 

TTCTTCGCTCCAACGAGATCGTCCTTCCAGCTAGTGGACTGGTGGAAACA 

CCTTCAGTACCCTCATTTCCTTAAGCGAGATGCCAACAAGCT 

TACAAGAATCGGACCATCTTGGGCTATAAGACCTTGGCCGTGGGACTCATCAACATGGCAGAGGTGATGC 
AGCATCCTAATGAAGGCGCACTGGTGCTTGGCCTACACAGCAACGTGAAGGATGTCTCTGTGCCTGTGGC 

AGAAATAAAGATCTACTCCCTGTCCAGCC^ 

CGTTCTCCTGATATTGACAATTATTCTGAGGAAGAGGAAGAGAGTTTCTCATCAGAACAGGAAGGCAGTG 
ATGATCCATTGCATGGGCAGGACTTGTTCTACGAAGACGAAGATCTCCGGAAAGTGAAGAAGACCCGGAG 
GAAACTAACCTCAACCTCTGCCATCACAAGGCAACCTAACATCAAACAGAAGTTTGTGGCCCTCCTGAAG 
CGGTTTAAAGTTTCAGATGAGGTGGGCTTTGGGCTGGAGCATGTGTCCCGCGAGCAGATCCGGGAAGTGG 
AAGAGGACTTGGATGAATTGTATG ACAGTCTGGAGATGTACAAC CC CAG ^GACAGTGGCCCTGAG^TGGA 
GGAGACAGAAAGCATCCTCAGCACGCCAAAGCCCAAGCTCAAGCCTTTCTTTGAG^ 

AGCTCCCAGACGGAGATTGGCAGCCTGAACAGCAAAGGCAGCCTCGGAAAAGACACCACCAGCCCTATGG 

AATTGGCTGCTCTAGAAAAAATTAAATCTACTTGGATTAAAAACC^ 

CACTCTGGAAATC^CTGACCAGGACATGTTTGGAGATGCCAGCACGAGTCTGGTTGTGC 

AAAACTCCCATGAAGTCCAGTAAAACGGATCTCCAGGGCTCTGCCTCCCCCAGCAAAGTGGAGGGGGTGC 

'acac^ccccggcagaagaggagcacgcccctgaaggagcggcagctctccaagcccctaagtgagaggac 

CAACAGTTCCGACAGCGAGCGCTCCCCAGATCTGGGCCACAGGACGCAGATTCGAAGAAAGGTGGTGTAT 
GACCAGCTCAATCAGATCCTGGTGTCAGATGCAGCCCTCCCAGAAAAT 

ACTGGCAGGGCCAGTATGTGGCTGAGCTGCTCCAGGACCAGCGGAAGCCTGTGGTGTGCACCTGCTCCAC 
CGTGGAGGTCGAGGCCGTGCTGTCCGCCCTGCTC^CCCGGAT 

ATGCCGAGGCCAGTGAAGGTGGCTGCTGTGGGAGGCCAGAGCTACCTGAGCTCCATCCTCAGGTTCTTTG 
TCAAGTCCCTGGCCAACAAGACCTCCGACTGGCTTGGCTACATGCGCTTCCTCATCATCCCCCTCGGTTC 
TCACCCTGTGGCCAAATACTTGGGGTCAGTCGACAGTAAATACAGTAG 

AGAGATCTGTTCAGTCGCTCGGAGCCACC^GTGTCAGAGCAACTGGACGTGGCAGGGCGGGTGATGCAGT 

ACGTCAACGGGGCAGCGACGACACACCAGCTTCCCGTGGCCGAAGCCATGCTGACTTGCCGGCATAAGTT 

CCCTGATGAAGACTCCTATCAGAAGTTTATTCCCTTCATTGGCGTGGTGAAGGTGGGTCTGGTTGAAGAC 

TCTCCCTCCACAGCAGGCGATGGGGACGATTCTCCTGTGGTCAGCCTTACTGTGCCCTCCACATCACCAC 

CCTCCAGCTCGGGCCTGAGCCGAGACGCCACGGCC^CCCCTCCCTCCTCCCCATCTATGAGCAGCGCCCT 

GGCCATCX3TGGGGAGCCCTAATAGCCCATATGGGGACGTGATTGGCCTCCAGGTGGACTACTGGCTGGGC 

CACCCCGGGGAGCGGAGGAGGGAAGGCGAC^AGAGGGACGCCAGCTCGAAGAACACCCTCAAGAGTGTCT 

TCCGCTC^GTGCAGGTGTCCCGCCTGCCCC^TAGTGGGGAGGCCC^GCTTTCTGGC^CCATGGCCATGAC 

TGTGGTCACCAAAGAAAAGAACAAGAAAGTTCCCACCATCTTCCTGAGCAAGAAACC 

GTGGATTCTAAGAGCCAGGTCATTGAAGGCATCA.GCCGCCTCATCTGCTCAGCCA^ 

TGCTGAGAGTGTCCATCGATGGGGTCGAGTGGAGTGACATCAAGTTCTTCCAGCTGGCAGCCCAGTGGCC 
CACCCATGTCAAGCACTTTCCAGTGGGACTCTTCAGTGGCAGCAAGGCCACCTGAGGCCCTGTCTCCCAG 
CCACTTTCCCTCCTGGCACTGCCACCAGCCTCACCGCCTGCGGGCAGGGGGAGGCCAGCAGGCCCGGGCC 
CJVGCACCCCTTCCCTGGCACCAGGGTCTGCCTCTCACTCGCCCAGG 

CGCCTTCCCTCCCCTCCCCTCCAGCCCACCCCTGCACAGCCCCTCCTCCTTCCCGCTTTTCCCCTTCTCC 

CTCCTGCTCCAGGCCCAAGGCGTGTTGGTTTTGCCTTCTGGTGCCGATAGTCCCCTGGACTGAGTCCCCC 

AGGCCTTCCTTCACCCGACTTCCAAACTCTTCCTTGTGGTATGAGTTTCCTTCTCGGAAATG 

GGAATCCTGGTCCCCAGCAGGAGAGCCTAGTGCTCCCCCAGCCCCTCCAGCCACCAGGGTGTCCTCTAG 

ATGCAGCTGCCAGATCCACTCACTCTGCTGCCTCCAGCAGGACCCAAGGCCACTTTCAACTCTTATGGGG 

TTCTCCACCTGCCCCAGAGCTTCTCAAGGGAGGGTAAGGGGGCACCCTGAGCCC^ 

ACAGCTCACAGGGGCAGGAGGCAGCTCCCCTGCCTCCAGGACCCTGTTGCTATGGTGACACAGCGTTTCT 
. AGGACAGAGGGGCCTCCCAGTCTCCCCCC^CCACCCGTGCACGACTTCCTC^CCACCCCC^ 

CAGATGTCGTGTGTGTCCTGAGTGTTTCTTTGGTTCTTTGCACGCCAAGTCTCTTGGTTGTACCATGTGA 
CACACCCTGTGCACTGGTCGCTGTCTTCGTGGCTTCCACCCTTGTTAATGATGCTCCTGCCTCTGCCTCC 

GAGCCCCTGACCCAGCAOM3CT 

CTGTCTGTGGCCCCTCAGACATTCTGTTTCATCTCCCATTCATCTCCCTCCTCCCACCGTGTCAGTTTTT 
CTGCCTTTCCCTGCTCTGTTCTTCCCCCTCCTTAGGCCCCAGCCTGGGCCCAGACCCATCCTCCCAGCCA 
GGTTTCCCTCCAGCAGGCTCCTTCCCTCCCTGTCACCTCCCTCTCACCAACCCGGGGTCTGAGCCCpTCA 
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TTCCTGACCGTCCGTGTTCTCAGGAGTGGTTGAGGACACAGGGCCCCAGCCCAGCC^TGCACCCCCCA 
AAAAAAAAAAAAAAAA 

Human PACS-1 mRNA sequence - var3 (pubbc gi: 33243994) (SEQ^NO:97) 

CAGAAAGCATCCTCAGCACGCCAAAGCCCAAGCTC^^ 

ccagj^ggagattggcagcctcaa^ 

GCTGCTCTAGAAAAAATTAAATCTACTTGGATTAAAAACCAAGA 

SStCACTGACCAGGACATGTTTGGAGATGCCAG^^ _ 

TCCCATGAAGTCCAGTAAAACGGATCTCCAGGGCTCTGCCTCCCCCAGCAAAGTGGAGGG 

CCCCGGCAGAAGAGGAGCACGCCCCTGAAGGAGCGGCAGCTCTCCAAGCCCCTAAGTCAGAGGACCAA 

GTTCCGACAGCGAGCGCTCCCCAGATCTGGGCCACAGCACGCAGATTCCAAGAAAGGTGGTCT 

GCTCAATCAGATCCTGGTGTCAGATGCAGCCCTCCCAGAAAATGTCATTCTGGTGAACACCACTGA^ 

CAGGGCCAGTATGTGGCTGAGCTGCTCCAGGACOIGCGG^ 

GAGGCCAGTGAAGGTGGCTGCTGTGGGAGGCC^GAGCTACCTGAGCTCCATCCTCAGGTTCTTTGTCAAG 
TCCCTGGCCAACAAGACCTCCGACTGGCTTGGCTACATGCGCTTCCTCATCATCCCCCTCGGTTCTCACC 
CTGTGGCCAAATACTTGGGGTCAGTCGACAGTAAATACAGTAGTTCCTTCCTGGATTCTGGTTGGAGAGA 
TCTGTTCAGTCGCTCGGAGCCACCAGTGTCAGAGCAACTGGACGTGGCAGGGCGGGTGATGCAGTACGTC 
AACGGGGGAGCCACGACACACCAGCTTCCCGTGGCCGAAGCCATGCTGACTTGCCGGCATAAGTTCCCTG 
ATGAAGACTCCTATCAGAAGTTTATTCCCTTCATTGGCGTGGTGAAGGTGGGTCTGGTTGAAGACTCTCC 
CTCCACAGCAGGCGATGGGGACGATTCTCCTGTGGTCAGCCTTACTGTGCCCTCCACATCACCACCCTCC 
AGCTCGGGCCTGAGCCGAGACGCCACGGCCACCCCTCCCTCCTCCCCATCTATGAGGAGCGCCCTGGCCA 
TCGTGGGGAGCCCTAATAGCCCATATGGGGACGTGATTGGCCTCCAGGTGGACTACTGGCTGGGCCACCC 
'cGGGGAGCGGAGGAGGGAAGGCGACAAGAGGGACGCCAGCTCGAAGAACACCCTCAAGAGTGTCTTCCG^ 
TCAGTGCAGGTGTCCCGCCTGCCCCATAGTGGGGAGGCCCAGCTTTCTGGCACCATGGCCATGACTGTGG 
TCACCAAAGAAAAGAACAAGAAAGTTCCCACCATCTTCCTGAGCAAGAAACCCCGAGAAAAGGAGGTGGA 
TTCTAAGAGCCAGGTCATTGAAGGCATCAGCCGCCTCATCTGCTCAGCCAAGCAGCAGCAGACTATGCTG 
AGAGTGTCCATCGATGGGGTCGAGTGGAGTGACATCAAGTTCTTCGAGCTGGCAGCCCAGTGGCCCACCC 
ATGTCAAGCACTTTCCAGTGGGACTCTTCAGTGGCAGCAAGGCCACCTGAGGCCCTGTCTCCCAGCCACT 
TTCCCTCCTGGCACTGCCACCAGCCTCACCGCCTGCGGGCAGGGGGAGGCCAGCAGGCCCGGGCCCAGCA 
CCCCTTCCCTGGCACCAGGGTCTGCCTCTCACTCGCCCAGGTCCCGAAGGACACTGCCACAGGGACGCCT 
TCCCTCCCCTCCCCTCCAGCCCACCCCTGCACAGCCCCTCCTCCXTCCCGCTTTTCCCCTTCTCCCTCCT 
GCTCCAGGCCCAAGGCGTGTTGGTTTTGCCTTCTGGTGCCCATAGTCCCCTGGACTGAGTCCCCCAGGCC 
TTCCTTCACCCGACTTCCAAACTCTTCCTTGTGGTATCAGTTTCCTTCTTGGAAATGAGAAAGCTGGAAT 
CCTGGTCCCCAGCAGGAGAGCCTAGTCCTCCCCCAGCCCCTCCAGCCACCAGGGTGTCCTCTAGGATGCA 
GCTGCCAGATCCACTCACTCTGCTGCCTCCAGCAGGACCCAAGGCCACTTTCAACTCTTATGGGGTTCTC 
CACCTGCCCCAGAGCTTCCCAAGGGAGGGTAAGGGGGCACCCTGAGCCCACAGGACCCCTACTTCACAGC 
TCACAGGGGCAGGAGGCAGCTCCCCTGCCTCCAGGACCCTGTTGCTATGGTGACACAGCGTTTCTAGGAC 
AGAGGGGCCTCCCAGTCTCCCCCCACCACCCGTGCACGACTTCCTCACCACCCCCAGGTTCCCTGCAGAT 
GTCGTGTGTGTCCTGAGTGTTTCTTTGGTTCTTTGCACGCCAAGTCTCTTGGTTGTACCATGTGACACAC 
CCTGTGCACTGGTCGCTGTCTTCGTGGCTTCCACCCTTGTTAATGATGCTCCTGCCTCTGCCTCCCAGCC 
CCTCACCCAGCACAGCTCTGCCTGGACTTGGAGAGATGGGAGGCAGACCCCCACCACCATACATGCTGTC 
TGTGGCCCCTCAGACATTCTGTTTCATCTCCCATTCATCTCCCTCCTCCCACCGTGTCAGTTTTTCTGCC 
TTTCCCTGCTCTGTTCTTCCCCCTCCTTAGGCCCCAGCCTGGGCCCAGACCCATCCTCCCAGCCAGGTTT 
• CCCTCCAGCAGGCTCCTTCCCTCCCTGTCACCTCCCTCTCACCAACCCGGGGTCTGAGCCCCTCATTCCT 
GACCGTCCGTGTTCTCAGGAGTGGTTGAGGACACAGGGCCCCAGCCCAGCCCTCTGCACCCCCCAGCCCG 
GCCATCTGCGCCCCACAGCCCCTTTGGAGCTTTTCTCTTGTCCTCTCACTCCTTCCCAGAAGTTTTTGCA 
CAGAACTTCATTTTGAAAGTGTTTTTCTCATTCTCTATACCTCCCCCAAGCTCTCCTCCAGCCCTTCCCA 
GGGCTCAGCCCTGCTGTCCTGAGCGTCTCCTGGGCCAGAGAGAGGAGATGGGGGTGGGAGGGACTGAGTT 
GATGTTGGGTTTTTCATTCAATAAATTGGTGATTTCTTACCGACAAAAAAAAAAAAAAA 

Human PACS-1 raRNA sequence - var4 (public gi: 34420884) (SEQ ID NO: 98) 

CGCCGCCGCCGCCGCGGGGGAAGCCTGGGAGCCAGATCGGCGTCGCCTCGGCCTCCGTAACCCCCGCCTA 
GCCGGGCCATGGCGGAACGCGGAGGGGCGGGCGGTGGTCCCGGAGGCGCCGGGGGCGGCAGCGGCCAGCG 
GGGATCCGGGGTCGCCCAGTCCCCTCAGCAGCCGATGGCGGAACGCGGAGGGGCGGGCGGTGGTCCCGGA 
GGCGCCGGGGGCGGCAGCGGCCAGCGGGGATCCGGGGTCGCCCAGTCCCCTCAGCAGCCGCCGCCGCAGC 
AGCAGCAGCAGCAGCCGCCGCAGCAGCCGACGCCCCCCAAGCTGGCCCAGGCCACCTCGTCGTCCTCGTC 
CACCTCGGCGGCGGCTGCCTCCTCCTCGTCCTCGTCTACCTCCACCTCCATGGCCGTGGCGGTGGCCTCG 
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mtctocgctccaacgagatcgtc^ 
cccttcagtaccctcatttccttaagcgagatgcc^ 

<^g^cctaatgaaggcgcac^^^ 

cagaaataaagatctactccctgtccagccaacccattgaccatgaaggaatcaaatccaagct 
tcgttctcctgatattgacaattattctgaggaagaggaagagagtttctcatcagaacaggaaggcagt 

GATGATCCATTGCATGGGCAGGACTTGTTCTACGAAGACGAAGATCTCCGGAAAGTGAAGAAGACCCGGA 
GGAAACTAACCTCAACCTCTGCCATCACA^^ 

GCGGTTTAAAGTTTCAGATGAGGTGGGCTTTGGGCTGGAGCATGTGTCCCGCGAGCAGATCCGGGAAGTG 
GAAGAGGACTTGGATGAATTGTATGACAGTCTGGAGATGT ACAACCC CAGCG ACAGTGGC CCTGAGATGG 
AGGAGACAGAAAGCATCCTCAGCACGCCAAAGCCCAAGCT 
CAGCTCCGAGACGGAGATTGGCAGCCTCAAGkGCAAAGG^ 

GAATTGGCTGCTCTAGAAAAAATTAAATCTACTTGGATTAAAAACCAAGATGACAGGTTGACTGAAACAG 
ACACTCTGGAAATCACTGACCAGGACATGTTTGGAGATGCCAGCACGAGTCTGGTTGTGCCGGAGAAAGT 

C^AAACTCCCATGAAGTCC^GTAAAACGGATC^ 

(^CACACCCCGGCAGAAGAGGAGCACGCCCCTGAAGGAGCGGGAGCTCTCCAAGCCCCTAAGTGAGAGGA 

CCAACAGTTCCGACAGCGAGCGCTCCCCAGATCTGGGCCACAGCACGCAGATTCCAAGAAAGGTGGTGTA 

TGACCAGCTC^TCAGATCCTGGTGTCAGATGCAGCCOTCCCAGAAAATGTCATTCTGGTGAACACCAC 

GACTGGCAGGGCCAGTATGTGGCTGAGCTGCTCCAGGACCAGCGGAAGCCTGTGGTGTGCACCTGCTCCA 

CCGTGGAGGTCCAGGCCGTGCTGTCCGCCCTGCTCACCCGGATCCAGCGCTACTGCAACTGCAACTCTTC 

CATGCCGAGGCCAGTGAAGGTGGCTGCTGTGGGAGGCCAGAGCTACCTGAGCTCCATCCTCAGGTTCTTT 

GTCAAGTCCCTGGCCAACAAGACCTCCGACTGGCTTGGCTACATGCGCTTCCTCATCATCCCCCTCGGTT 

CTCACCCTGTGGCCAAATATTTGGGGTCAGTCGACAGTAAATACAGTAGTTCCTTCCTGGATTCTGGTTG 

GAGAGATCTGTTCAGTCGCTCGGAGCCACCAGTGTCAGAGCAACTGGACGTGGCAGGGCGGGTGATGCAG 

TACGTCAACGGGGCAGCCACGAGACACCAGCTTC 

TCCCTGATGAAGACTCCTATCAGAAGTTTATTCCCTTCATTGGCGTGGTGAAGGTGGGTCTGGTTGAAGA 

CTCTCCCTCCACAGCAGGCGATGGGGACGATTCTCCTGTGGTCAGCCTTACTGTGCCCTCCACATCACCA 

CCCTCCAGCTCGGGCCTGAGCCGAGACGCCACGGCCACCCCTCCCTCCTCCCCATCTATGAACAGCGCCC 

TGGCCATCGTGGGGAGCCCTAATAGCCCATATGGGGACGTGATTGGCCTCCAGGTGGACTACTGGCTGGG 

CCACCCCGGGGAGCGGAGGAGGGAAGGCGAGAAGAGGGACGCCAGCTCGAAGAACACCCTCAAGAGTGTC 

TTCCGCTCAGTGCAGGTGTCCCGCCTGCCCCATAGTGGGGAGGCCCAGCTTTCTGGC^CCATGGCC 

CTGTGGTCACCAAAGAACTGAACAAGAAAGTTCCCAC^ 

GGTGGATTCTAAGAGCCAGGTCATTGAAGGCATCAGCCGCCTCATCTGCTCAGCCAAGCAGCAGCAGACT 
ATGCTGAGAGTGTCCATCGATGGGGTCGAGTGGAGTGACATCAAGTTCTTCCAGCTGGCAGCCCAGTGGC 
CCACCCATGTCAAGCACTTTCCAGTGGGACTCTTCAGTGGCAGCAAGGCCACCTAG 

Human PACS-1 xriRNA sequence - var5 (public gi: 6330230) (SEQ ID NO: 99) 

CTGCCATCACAAGGC»AGCT^ 

TGAGGTGGGCTTTGGGCTGGAGCATGTGTCCCGCGAGCAGATCCGGGAAGTGGAAGAGGACTTGGATGAA 

TTGTATGACAGTCTGGAGATGTACAACCCCAGCGACAGTGGCCCTGAGATGGAGGAGACAGAAAGCATCC 

TCAGCACGCC?U^GCCCAAGCT(^GCCTTTCTTTGAGGGGATGTCGCAGTCCAGCTCCCAGACGGAGAT 

TGGCAGCCTCAACAGCAAAGGCAGCCTCGGAAAAGACACC^CCAGCCCTATGGAATTGGCTGCTCTAG^ 

AAAATTAAATCTACTTGGATTAAAAACCAAGATGACAGCTTGACTGAAACAGACACT 

ACCAGGACATGTTTGGAGATGCCAGCACGAGTCTGGTTGTGCCGGAGAAAGTCAAAACTCCCATGAAGTC 

CAGTAAAACGGATCTCCAGGGCTCTGCCTCCCCCM 

AGGAGCACGCCCCTGAAGGAGCGGC^GCTCTC(^GCCCCTAAGTGAGAGGACC^CAGTTCCGACAGCG 

AGCGCTCCCGAGATCTGGGCCACAGCACGCAGATTCCAAGAAAGGTGGTGT^ 

CCTGGTGTCAGATGCAGCCCTCCCAGAAAATGTCATTCTGGTG^^ 

GTGGCTGAGCTGCTCCAGGACCAGCGGAAGCCTGTGGTGTGCACCTGCTC 

TGCTGTCCGCCCTGCTCACCCGGATCCAGCGCTACTGCAA 

GGTGGCTGCTGTGGGAGGCCAGAGOTACCTGAGCTCCATCCTCAGGTTCTTTGTCAAGTCCOT 

AAGACCTCCGACTGGCTTGGCTACATGCGCTTCCTCATCATCCCCCTCGGTTCTCACCCTGTGGCCAAAT 

ACTTGGGGTCAGTCGACAGTAAATACAGTAGTTCCTTCCTGGATTCTGGTTGGAGAGATCTGTTCAGTCG 

CTCGGAGCCACCAGTGTCAGAGCAACTGGACGTGGCAGGGCGGGTGATGCAGTACX5TCAACGGGGCAGCC 

ACGAC^CACCAGCTTCCCGTGGCCGAAGCCATGCTGACTTGCCGGCATAAGTTCCCTGATGAAGACTCCT 

ATCAGAAGTTTATTCCCTTCATTGGCGTGGTGAAGGTGGGTCTGGTTGAAGACTCTCCCTCCA(^GCA^ 

CGATGGGGACGATTCTCCTGTGGTCAGCCTTACTGTG 

AGCCGAGACGCCACXSGCCACCCCTCCCTCCTCCCCATCTATGAGCAGCGCCCTGGCCATCGTGGGGAGC^ 
CTAATAGCCCATATCK3GGAOGTGATTGGCCTCCAGGTC 

GAGGGAAGGCGACAAGAGGGACGCCAGCTCGAAGAACACCCTCAAGAGTGTCTTCCGCTCAGTGCAGGTG 
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TCCCGCCTGCCCCATAGTGGGGAGGCCCAGCTTTCTGGCACCATGGCCATGACTGTGGTCACCAAAGAAA 

S£Stg^ 
ttcSgt^ 

actgccaccagcctcaccgcctgcgggcagggggaggcc^gcaggcccgggcccagcacccot^ 
caccagggtctgcctctcactcgccc^ggtcccgaaggacactgccacagggacgccttccctcccctcc 

^CCMCCCACCCCTGCACAGCCCCTCCTCCTTCCCGCTTTTCCC^ 

aggcgtgttggttttgccttct^ 

acttccaaactcttccttgtggtatcagtttccttctcggaa^ 

caggagagcctagtcctcccccagcccctc^^ 

actcactctgctgcctccagc^ggacccaaggccactttcaactcttatggggttctcc^c 
agcttcccaagggagggtaaggxxsgcaccct^ 

gaggcagctcccctgcctccaggaccctgttgctatggtgacac^gcgtttctaggacagaggggc^^ 

CAGTCTCCCCCCACCACCCGTG^CGACra 

CTGAGTGTTTCTTTGOT 

TCGCTGTCTTCGTGGCTTCCACCCra 

CAGCTCTGCCTGGACTTGGAGAGATGGGAGGCAGACCCCCACCACCATACATGCTGTCTGTGGCCCCTCA 
GACATTCTGTTTCATCTCCCATTC^TCTCCCTCCTCCCACCGTGTCAGTTTTTCTGCCTTTCCCTGCTCT 
GTTCTTCCCCCTCCTTAGGCCCC^GCCTGGGCCC^GACCCATCCTCCCTVGCCAGGTTTCCCTCCAGCAGG 
CTCCTTCCCTCCCTGTCACCTCCCTCTCACCAACCCGGGGTOTGAGCCCCTCATTCCTGACCGTCCGTGT 
TCTCAGGAGTGGTTGAGGACACAGGGCCCCAGCCCAGCCCTCTGCACCCCCCAGCCCGGCCATCTGCGCC 
CCACAGCCCCTTTGGAGCTTTTCTCTTGTCCTCTCACTCCTTCCCAGAAGTTTTTGCACAGAACTTCATT 
TTGAAAGTGTTTTTCTCATTCTCCATACCTCCCCCAAGCTCTCCTCCAGCCCTTCCCAGGGCTCAGCCCT 

• qcxGTCCTGAGCGTCTCCTGGGCCAGAGAGAGGAGATGGGGGTGGGAGGGACTGAGTTGATGTTGGGTTT 

TTCATTCAATAAATTGGTGATTTCTTACCG 

• Human PACS-1 mRNA sequence - var6 (public gi: 7022110) (SEQ ID NO: 100) 

CCCTAAGTGAGAGGACCAACAGTTCCGACAGCGAGCGCTCCCCAGATCTGGGCCACAGCACGCAGATTCC 
AAGAAAGGTGGTGTATGACCAGCTCAATCAGATCCTGGTGTCAGATGCAGCCCTCCCAGAAAATGTCATT 
CTGGTGAACACCACTGACTGGCAGGGCGAGTATGTGGCTGAGCTGCTCCAGGACCAGCGGAAGCCTGTGG 

TGTGCACCTGCTCCACCGTGGAGGTCCAGGCCGTGC^^ 

CAACTGCAACTCTTCCATGCCGAGGCCAGTGAAGGTGGCTGCTGTGGGAGGCCAGAGCTACCTGAGCTCC 

ATCCTCAGGTTCTTTGTC^GTCCCTGGCCAACATGACCTCCGACTGGCTTGGCTACATGCGCT 

TCATCCCCCTCGGTTCTCACCCTGTGGCGAAATACTTGGGG 

CCTGGATTCTGGTTGGAGAGATCTGTTCAGTCGCTCGGAGCC^tCAGTGTCAGAGCAACTGGACGTGGCA 
GGGCGGGTGATGCAGTACGTG^CGGGGCAGCCACGACAGACCAGCTTCCCGTGGCCGAAGCCATGCTGA 
CTTGCCGGCATAAGTTCCCTGATGAAGACTCCTATCAGAAGTTTATTCCCTTCATTGGCGTGGTGAAGGT 
GGGTCTGGTTGT^GAOTCTCCCTCCACAGCAGGCGATGGGGACGATTCTCCTGTGGTC^GCCTTACTGTC 
CCCTCCA(^TC^CCACCCTCCAGCTCGGGCCTGAGCCGAGACGCCACGGCCACCCCTCCCTCCTCCCCAT 
CTATGAGCAGCGCCCTGGCCATCGTGGGGAGCCCTAATAGCCCATATGGGGACGTGATTGGCCTCCAGGT 
GGACTACTGGCTGGGCCACCCCGGGGAGCGGAGGAGGGAAGGCGACAAGAGGGACGCCAGCTCGAAGAAC 
ACCCTCAAGAGTGTCTTCCGCTCAGTGCAGGTGTCCCGCCTGCCCCATAGTGGGGAGGCCCAGCTTTCTG 
GC^CCATGGCCATGACTGTGGTCACCAAAGAAAAGAACAAGAAAGTTCCCACO 

ACCCCGAGAAAAGGAGGTGGATTCTAAGAGCCAGGTCATTGAAGGCATCAGCCGCCTCATCTGTTCTTCC 
CCCTCCTTAGGCCCCAGCGTGGGCCCAGACCCATCCT 

CTCCCTGT(^CCTCCCTCTCACCAACCCGGGGTCTGAGCCCCTCATTCCTGACCGTCCGTGTTCTCAGGA 
GTGGTTGAGGACACAGGGCCCC&GCCCAGCCCT^^ 

CCTTTGGAGCTTTTCTCTTGTCCTCTCACTCCTTCCCAGAAGTTTTTGCACAGT^ACTTCATTTTGAAAGT 
GTTTTTCTC^TTCTCCATACCTC^ 

GAGCGTCTCCTGGGCCAGAGAGAGGAGATGGGGGTGGGAGGGACT^ 
ATAAATTGGTGATTTCTTACCGAC 

Human PACS-1 protein sequence - varl (public gi: 7022111) (SEQ ID NO: 362) 

MPRPVKVAAVGGQSYLSSILRFFVKSIiANMTSDWI^ 

RDLFSRSEPPVSEQIjDVAGRVMQYVNGAATTHQLPVAEAMIiTCRHKFPDEDSyQKFI PFI GWKVGLVED 
SPSTAGDGDDSPWSLTWSTSPPSSSGLSRDATATPPSSPSMSSAIiAIVGSPNSPYGDVIGLQVDYWIiG 

HPGERRREGDKRDASSKNTLKSVFRSVQVSRLPHSGEAQLSGTM^ 

VDSKSQVIEGISRLICSSPSLGPSLGPDPSSQPGFPPAGSFPPCHLPLTNPGSEPblPDRPCSQEWLRTQ 
GPSPALCTPQPGHLRPTAPIjELFSCPLTPSQKFIiHRTSF 

Human PACS-1 protein sequence - var2 (public gi: 6330231) (SEQ ID NO: 363) 
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AITRQPNIKQK^ALLKRFKVSDEVGFGLEHVSREQIREVEEDLDELYDSLEMYNPSDSGPEMEETES 
STPKPKLKPFFEGMSQSSSQTEIGSLNSKGSIiGKDTTSPMELAALEKIKSTWIKNQDDSLTETDTLEI 

QDMFGDASTSLWPEKV^PMKSSKTDLQGSASPSKVEGW^ 

RS PDLGHSTQ I PRKWYDQLNQILVSDAALPENVILWTTDWQGQ WAELLQDQRKPWCTCSTVEVQAV 

liSALLTRIQRYCNCNSSMPRPVKVAAVGGQSYLSSILRFFVKSLANKTSD 
- LGSVDS KYS S SFLDSGWRDLFSRSEPPVSEQLDVAGRVMQYWGAATTHQLPVAEAMLTCRHKFPDEDS Y 

QKFIPFIGVVKVGLVEDSPSTAGDGDDSF^ 
^ tfSPYGDVIGEQVDYWLGHPGERRREGDKRDAS^ 

NKKVPTI FLSKKPREKEVDSKSQVIEGI SRLI CSAKQQQTMLRVSIDGVEWSDIKFFQLAAQWPTHVKHF 

PVGLFSGSKAT , 

HumanPACS-1 protein sequence - var3 (public gi: 34420885) (SEQ ID NO: 364) 

MAERGGAGGGPGGAGGGSGQRGSGVAQSPQQPPPQQQQQQPPQQPTPPKIjAQATSSSSSTSAAAASSSSS 

S TSTSMAVAVASGS AP PGGPGPGRTP AP VQMNL YATWEVDRS S S S CV PRLFS LTLKKLVMLKEMDKDLNS 

WIAVKLQGSKRILRSNEIVLPASGLVETELQLTFSLQYPHFLKRDANKLQIMLQRR^^ 

LAVGLINMAEVMQHPNEGALVLGLHSNVKDVSVPVAE I KI YS LSSQP I DHEGI KS KLSDRS PD I DNYSEE 

EEESFSSEQEGSDDPLHGQDLFYEDEDLRKVKKTRRKLTSTSAITRQPNIKQKFVALLKRFKVSDEVGFG 

LEHVSREQIREVEEDLDELYDSLEMYNPSDSGPEMEETESILSTPKPKLKPFFEGMSQSSSQTEIGSLNS 

KGSLGKDTTSPMELAALEKIKSTWIKNQDDSLTETDTLEITDQDMFGDASTSLW 

QGSASPSKVEGVHTPRQKRSTPLKERQLSKPLSERTNSSDSERSPD1X3HSTQIPRKVVYDQLNQILVSDA 
ALPENVI LWTTDWQGQYVAELLQDQRKPVVCTCSTVEVQAVLSALLTRI QRYCNCNS SMPRPVKVAAVG 
GQS YLSS I LRFFVKSLANKTSDWLGYMRFLI I PLGSHPVAKYLGSVDSKYSSSFJjDSGWRDLFSRSEPPV 
SEQLDVAGRVMQYVNGAATTHQLPVAEAMLTCRHKFPDEDSYQKFI PFIGWKVGLVEDS PSTAGDGDDS 
PWSLTVPSTS PPS S SGLSRDATATP PS S PSMNSALAI VGS PNS PYGDVI GLQVD YWLGHPGERRREGDK 
RDASSKNTLKSVFRSVQVSRLPHSGEAQLSGTMAMTVVTKELNKKVPTI FLSKKPREKEVDSKSQVI EGI 
SRLICSAKQQQTMLRVSIDGVEWSDIKFFQLAAQWPTHVKHFPVGIiFSGSKAT 

'Hitman PACS-1 protein sequence- var4 (public gi: 33243995) (SEQ ID NO: 365) 

ES I LSTPKPKLKPFFEGMSQSSSQTEIGSLNS KGSLGKDTTSPMELAALEKI KSTWI KNQDDSLTETDTL 
EITDQDMFGDASTSLVVPEKVKTPMKSSKTDLQGSASPSKVEGVOT 

SDSERS PDLGHSTQI PRKWYDQLNQ I LVSDAALPENVI LVNTTDWQGQYVAELLQDQRKPVVCTCSTVE 
VQAVLSALLTRIQRYCNCNSSMPRPVKVAAVGGQS YIiSS I LRFFVKSLANKTSDWLGYMRFIiI I PLGSHP 

VAKYLGSVDS kyss s fldsgwrdlfsrseppvseqldvagrvmqyvngaatthqlpvaeamltcrhkfpd 

EDSYQKFIPFIGWKVGLVEDSPSTAGDGDDSPWSLTVPSTSPPSSSGLSRDATATPPSSPSMSSALAI 
VGSPNSPYGDVIGLQVDYWLGHPGERRREGDIO^ASSKNTLKSVFRSVQVSRLPHSGEAQLSGTMAMTVV 
TKEKiraKVPTIFLSKKPREKEVDSKSQVIEGISRLICSAKQQQTMLRVSIDGV^^ 
VKHFPVGLFSGSKAT 

Hmnan P ACS- 1 protein sequence - var5 (public gi: 3 0 962 84 s) (SEQ ID NO: 366) 

MAERGGAGGGPGGAGGGSGQRGSGVAQSPQQPPPQQQQQQPPQQPTPPKLAQATSSSSSTSAAAASSSSS 

STSTSMAVAVASGSAPPGGPGPGRTPAPVQMNLYATWEVDRSSSSCVTRLFSLTLKKL 

WI AVKLQGS KRI LRSNE I VLPASGLVETELQLTFS LQYPHFLKRDANKLQ I MLQRRKRYKNRT I LGYKT 

LAVGLINMAEVMQHPNEGALVLGLHSNVKDV^ 

EEESFSSEQEGSDDPLHGQDLFYEDEDLRXVKKTRRKLTSTSAITRQPNIKQKFVALLKRFKVSDEVGFG 
LEHVSREQIREVEEDLDELYDSLEMYNPSDSGPEMEETESILSTPKPKLKPFFEGMSQSSSQTEIGSLNS 
KGSLGKDTTSPMELAALEKI KSTW I KNQDDSLTETDTIJ I TDQDMFGDASTSLVVPEKVXTPMKSSKTDL 
QGSAS PS KVEGVHTPRQKRSTPLKERQLSKPLSERTNS SDSERS PDLGHSTQI PRKVVYDQLNQILVSDA 
ALPENVILVNTTDWQGQYVAELLQDQRKPWCTCSTVEVQAVLSALLTRIQRYCNCNSSMPRPVKVAA 
GQSYLSSILRFFVKSLANKTSDWLGYMRFLIIPLGSHPVAKYLGSVD^ 

SEQLDVAGRVMQYVNGAATTHQLPVAEAMLTCRHKFPDEDS YQKFI PF I GWKVGLVEDS PSTAGDGDDS 
PWSLTVPSTSPPSSSGLSRDATATPPSSPSMSSALAIVGSPNSPYGDVIGLQVDYWLGHPGERRREGDK 
RDASSKNTLKSVFRSVQVSRLPHSGEAQLSGTMAMTVVTKEKNKKVPTI^ 
SRLI CSAKQQQTMLRVS I DGVEWSDI KFFQLAAQWPTHVXHFPVGLFSGS KAT 

Unigene Names PPP1CA TJnigene ID; Hs. 183994 

Human PPP1 CA mRNA sequence - varl (public gi: 287796) (SEQ ID NO: 101) 

GCAAGGAGCTGCTGGCTGGACGGCX3GCATGTCCGACAGCGAGAAGCTCAACCTGGACTCGATCATCGGGC 
GCCTGGTGGAAGTGCAGGGCTCGCGGCCTGGCAAGAATGTACAGCTGACAGAGAACGAGATCCX5CGGTCT 
GTGCCTGAAATCCCGGGAGATTTTTCTGAGCCAGCCCATTCTTCTGGAGCTGGAGGCACCCCTCAAGATC 
TGCGGTGACATAC^CGGCCAGTACTACGACCTTCTGCGACTATTTGAGTATGGCGGTTTCCCTCCCGAGA 
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GCAACTACCTCTTTCTGGGrGGACTATGTGGACAGGGGCAAGCAGTC . 
TGTGACCTGCTGTGGTCTGACCOTGACAAGGACGTGCAGGGCTGGGGCGAGAACGACCGTGGCGTCTCTT 

c^ggtggtagaagacggct 
SgtSLagttto^^ 

Stc^ccccacccccx^^^ 

TCGATTGATTGTACAGAAATCATGCTQCCATGCTGGGGGGGGGTCACCCCGACCCCT 

cacggggaacatggagccttggtgtatttttcttttcttttt^ 
cccagggctgcttcctgcctgcacctgcggt^^^ 
gcaacggcaggccaggtcgtgggtctcca^^ 
acccatctggtctcttgaataaaggtcaaagctggatcggaatc " 

Human PPP1CA iuRNA sequence - var2 (public gi: 21758300) (SEQ ID NO: 102) 

AAAAAAAAAAAAAGTTTTCCCTCCATGAG 

CCGCCACCTCCAGCGTCCTCGGCAGGGAGTTGTGGTGCCGTAGAGGGCGGTCCCCGCGGCCCACGCCGCA 

CACC^CCTGGGCAGGGGGAGACTCAGGGC^AGGCCCACACACTCCCTGCCCCCCACGCACAC^ 

CCTTGTGCCAAAATTCAGACCAGACCCCTCACTGGACATTCAAGAAGCCCCGTCCTCCAACGTGTCTTAA 

ATTGCACACGAGCTCTCCCTGCCACTCCCCATCTGGTCCCCAGACCTCTCCAGGGATTCTACCTACCCAG 

GCTTCCAGGCCCAGCTGGGGTCCCCCTCCAGGATGGCTCCTGCAGCCCTGGGGGCCTGGGCCACCCTGGT 

GTGCCCCACCCTAGCATCTCCCTGGGGCGCACCTTTCCCTACCCCACTGGAGCTCCCTGAGGGCAGGGTC 

GAATCTCTCCCTCTCAGTGTAGCCTAGAGCGGGGTACTCAGGAGGGTCCGTAAGCCTTCCTGACTCTCCA 

GCTTAGAGGCCCCTCCTGAAGGCGTCGAGGCACTAGAGGTTTATCAGGAGGCCCTGGGTCAGCCTCTACG 

'tgggcaagagctctctgggaagacggggaggtctaaggccagcacagagtggccagagggccacaccaaa 

CTCCCATCCCTGGTCAGCCCAGGTGGCTCTCACCTGAGCAGGGCAGCTGGGCAGGTGGGTACACAGCCTC 

CACCAGGACACTCTCCTCCTCCTCCAGCTTCTCCAGCAGCGCCAGCACTGTG 

TCTGCCCGCGGGTCCAGACGCCOTGCC^ 
GCAGCAAGGAGGGGAAAGGGGCCTCCTGGACACCAC^^ 

GGAGCCGTGCAGGGCTCGCGGCCTGGCAAGAATGTACAGCTGACAGAGAACGAGATCCGCGGTCTGTGCC 

TGAAATCCCGGGAGATTTTTCTGAGCCAGCCCATTCTTCTGGAGCTGGAGGCACCCCTCAAGATCT 

TGACATACACGGCCAGTACTACGACCTTCTGCGACTATTTGAGTATGGCGGTTTCCCTCCCGAGAGCAAC 

TACCTCTTTCTGGGGGACTATGTGGACAGG^ 

ATAAGATCAAGTACCCCGAGAACTTCTTCCTGCTCCGTGGGAACCACGAGTGTGCCAGCATCAACCGCAT 

CTATGGTTTCTACGATGAGTGCAAGAGACGCTACAACATCAAACTGTGGAAAACCTTGACTGACTGCTTC 

AACTGCCTGCCCATCGCGGCCATAGTGGACGAAAAGATCTTCTGCTGCCACGGAGGCCTGTCCCCGGACC 

TGCAGTCTATGGAGCAGATTCGGCGGATCATGCGGCCCACAGATGTGCCTGACCAGGGCCTGCTGTGTGA 

CCTGCTGTGGTCTGACCCTGACAAGGACGTGCAGGGCIH3GGGCGAGAACGACCGTGGCGTCTCT 

TTTGGAGCCGAGGTGGTGGCCAAGTTCCTCCACAAGCACGACTTC 

TGGTAGAAGACGGCTACGAGTTCTTTGCGAAGCGGCAGCTGGTGACA 

TGGCGAGTTTGACAATGCTGGCGCCATGATGAGTGTGGACGAGACCCTCATGTGCTCTTTCCAGATCCTC 

AAGCCCGCCGACAAGAAGAAGGGGAAGTACGGGCAGTT 

CCCCACCCCGCAATTCCGCCAAAGCCAAGA^ 

TGATTGTACAGAAATCATGCTGCCATGCTGGGGGGGGGTGA£ 

GGAACATGGAGCCTTGGTGTATTTTTCTTTTCTTTTTTTAATGAATCAATAGCA 

GGCTGCTTCCTGCCTGCACOTGCGGTGACTGTGAGCAGGATCCTGGGGCCGAGGCTGCAGCTCAGGGCAA 

CGGCAGGCGAGGTCGTGGGTCTCC^GCCGTGCT^ 
ATCTGGTCTCTTGAATAAAGGTCAAAGCTGGATTCTCGC 

Human PPPlCAmRNA sequence- var3 (public gi: 14124967) (SEQ ID NO: 103) 

GGCTGCCGGAGGGCGGGAGGCAGGAGCGGGCCAGGAGCTGCTGGGCTGGAGCGGCGGCGCCGCCATGTCC 
GAC^GCGAGAAGCTCAACCTGGACTCGATCATCGGGCGCCTGCTGGAAGTGCAGGGCTCGCGGCCTGGCA 
AGAATGTACAGCTGACAGAGAACGAGATCCX3CGGTCTGTGCCTGAAATCCCGGGAGATTTTTCTGAGCCA 
GCCCATTCTTCTGGAGCTGGAGGCACCCCTCAAGATCTGC^GTGACATACACGGCCAGTACTACGACCTT 
CTGCGACTATTTGAGTATGGCGGTTTCCCTCCCGAGAGCAACTACCTCTTTCTGGGGGACTATGTGGA^ 
GGGGC^GC^GTCCTTGGAGAC(^TCTGCCTGCTGCTGGCCTATAAGATCAAGTACCCCGAGAACTTCTT 
CCTGCTCCGTGGGAACCACGAGTGTGCCAGCATCAACCGCATCTATGGTTTCTACGATGAGTGCAAGAGA 
CX3CTACAACATCAAACTGTGGAAAACCTTCACTGACTGCTTCAACT 

ACGAAAAGATCTTCTGCTGCCACGGAGGCCTGTCCCCGGACCTGCAGTCTATGGAGCAGATTCGGCGGAT 
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CATCCGGCCCACAGATGTGCCTGACCAGGGCCTOCrGTC 

GTGCAGGGCTGGGGCGAGAACGACCGTGGCGTCTCTTTTACCTTTGGAGCCGAGGTGGTGGCC^AGTTCC 

TCC^CAAGCACGACTTGGACCTC^TCTGCCGAGCACACCAGGTGGTAGAAGACGGCTA 

CAAGCGGCAGCTGGTGACACTTTTCTC^GCTCCC^CTACTGTGGCGAGTTTGACAATGCTGGCGCCATG 

ATGAGTGTGGACGAGACCCTCATGTGCTCTTTCCAGATCCTCAAGCCCGCCGACAAGAACAAGGGGAAGT 

ACGGGC^GTTCAGTGGCCTGAACCCTGGAGGCCGACCCATCACCCCACCCCGCAATTCCGCCAAAGCCAA 

GAAATAGCCCCCGCACACC7VCCCTGTGCCCCAGATGATGGATTGATTGTACAGAAATCATGCTGCCATGC 

TGGGGGGGGGTCACCCCGACCCCTCAGGCCCACOTGTCACGGGGAACATGGAGCCTTGGTGTATTTTTCT 

TTTCITTTTTTAATGAATCAATAGCAGCGTCCAGTCCCCCAGGGCTGCTTCCTGCCTGCACCTGCGGTGA 

CTGTGAGCAGGATCCTGGGGCCGAGGCTGCAGCTCAGGGC^ 

GTGCTTGGCCTCAGGGCTGGCAGCCX3GATCCTGGGGCAACCCATCTGGTCTCTTGAATAAAGGTCAAAGC 
TGGATTCTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human PPP1CA mRNA sequence- var4 (public gi: 33872 852) (SEQ ID NO: 104) 

CCTCGTGCCGAATTCGGCACGAGGAGCGGGCCAGGAGCTGCTGGGCTGGAGCGGCGGCGCCGCCATGTCC 
GACAGCGAGAAGCTCAACCTGGACTCGATCATCGGGCGCCTGCTGGAAGTGCAGGGCTCGCGGCCTGGCA 

AGAATGTACAGCTGACAGAGAACGAGATCCGCGGTCTGTGCCT^ 

GCCCATTCTTCTGGAGCTGGAGGC^CCCCTCAAGATCTGCGGTGACATAGACGGCCAGTACTACGACCTT 
CTGCGACTATTTGAGTATGGCGGTTTCCCTCCCGAGAGCAACTACCTCTTTCTGGGGGACTATGTGGACA 

GGGGCAAGC^GTCCTTGGAGACC^TCTGCCTGCTGCTGGC^^ 
CCTGCTCCGTGGGAACCACGAGTGTGCCAGCATCAACCGCATCTATGGTTTCTAC 

CGCTACAACATCAAACTGTGGAAAACCTTCACTGACTGCTTCAACTGCCTGCCCATCGCGGCCATAGTGG 
ACGAAAAGATCTTCTGCTGCCACGGAGGCCTGTCCCCGGACCTGCAGTCTATGGAGCAGATTCGGCGGAT 
CATGCGGCCCACAGATGTGCCTGACCAGGGCCTGCTGTGTGACCTGCTGTGGTCTGACCCTGACAAGGAC 
GTGCAGGGCTGGGGCGAGAACGACCGTGGCGTCTCTTTTACCTTTGGAGCCGAGGTGGTGGCCAAGTTCC 
TCCACAAGCACGACTTGGACCTCATCTGCCGAGCACACCAGGTGGTAGAAGACGGCTACGAGTTCTTTGC 

• £AAG03GCAGCTGGTGACACTTTTCTC^GCTCCCAACT^ 
ATGAGTGTGGACGAGACCCTCATGTGCTCTTTCCAGATCCTCAAGCCCGCCGACAAGAAC^GGGGAAGT 

ACGGGCAGTTCAGTGGCCTGAACCCTGGAGGCCGACCCATCACCCCACCCCGCAATTCCGCCAAAGCCAA 
GAAATAGCCCCCGCACACCACCCTGTGCCCCAGATGATGGATTGATTGTACAGAAATCATGCTGCCATGC 
TGGGGGGGGGGTCACCCCGACCCCTCAGGCCCACCTGTCACGGGGAACATGGAGCCTTGGTGTATTTTTC 
TTTTCTTTTTTTAATGAATCAATAGCAGCGTCCAGTCCCCCAGGGCTGCTTCCTGCCTGCACCTGCGGTG 
ACTGTGAGCAGGATCCTGGGGCCGAGGCTGCAGCTCAGGGCAACGGC^GCCAGGTCGTGGGTCTCCAGC 
CGTGCTTGGCCTCAGGGCTGGCAGCCGGATCCTGGGGCAACCC^TCTGGTCTCTTGAATAAAGGTCAAAG 
CTGGATTCTCGAAAAAAAAAAAAAAAAAA 

Human PPP 1 CA mRNA sequence - var5 (public gi: 12804878) (SEQ ID NO: 105) 

CAGGAGCGGGCCAGGAGCTGCTGGGCTGGAGCGGCGGCGCCGCCATGTCCGACAGCGAGAAGCTCAACCT 
GGACTCGATCATCGGGCGCCTGCTGGAAGTGCAGGGCTCGCGGCCTC 

AACGAGATCCGCGGTCTGTGCCTGAAATCCCGGGAGATTTTTCTGAGCCAGCCCATTCTTCTGGAGCTGG 
AGGCACCCCTCAAGATCTGCGGTGACATACACGGCCAGTACTACGACCTTCTGCGACTATTTGAGTATGG 
CGGTTTCCCTCCCGAGAGCAACTACCTCTTTCTGGGGGACTATGTGGACAGGGGCAAGCAGTCCTTGGAG 
ACCATCTGCCTGCTGCTGGCCTATAAGATCAAGTACCCCGAGAACTTCTTCCTGCTCCGTGGGAACCACG 
AGTGTGCCAGCATCAACCGCATCTATGGTTTCTACGATGAGTGCAAGAGACGCTACAACATCAAACTGTG 
GAAAACCTTCACTGACTGCTTCAACTGCCTGCCCATCGCGGCCATAGTGGACGAAAAGATCTTCTGCTGC 
CACGGAGGCCTGTCCCCGGACCTGCAGTCTATGGAGCAGATTCGGCGGATCATGCGGCCCACAGATGTGC 
CTGACCAGGGCCTGCTGTGTGACCTGCTGTGGTCTGACCCTGACAAGGACGTGCAGGGCTGGGGCGAGAA 
CGACCGTGGCGTCTCTTTTACCTTTGGAGCCGAGGTGGTGGCCAAGTTCCTCCACAAGCACGACTTGGAC 
CTCATCTGCCGAGCACACCAGGTGGTAGAAGACGGCTACGAGTTCT 

TTTTCTCAGCTCCCAACTACTGTGGCGAGTTTGACAATGCTGGCGCCATGATGAGTGTGGACGAGACCCT 
CATGTGCTCTTTCCAGATCCTCAAGCCCGCCGACAAGAACAAGGGGAAGTACGGGCAGTTCAGTGGCCTG 
AACCCTGGAGGCCGACCC^TCACCCCACCCCGC^T^ 

CCCTGTGCCCCAGATGATGGATTGATTGTACAGAAATCATGCTGCCATGCTGGGGGGGGGTCACCCCGAC 

CCCTCAGGCCGACCTGTCACGGGGAACATGGAG 

ATAGCAGCGTCCAGTCCCCCAGGGCTGCTTCCTGCCT^^ 

CCGAGGCTGGAGCTCAGGGCAACGGCAGGCCAGGTCGTGGGTCTCCAGCCGTGCTTGGCCTCAGGGCTGG 

CAGCCGGATCCTGGGGCAACCCATCTGGTCTCTTGAATAAAGGTCAAAGCTGGATTCTCA^^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human PPPlCAmJRNA sequence - var6 (public gi: 34534606) (SEQ ID NO: 106) 

CTTCTTGCTGACGCCGCCAGCGCCGACCACCAGCAGCTGTTTTCCCTCCATGAGGCAGCGCGCCGACCGC 
CGAAGCATGGTCTCCACCAGCGGCGCCGCCACCGCCTCGTCGGCCGCCGGCCCCAGCCGCGGCGCGGCCC 
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ArAGC CCCTCC^CGCGCCGCACGCCTC 
cScCC^GGTACTGCAGGGTGGGGCACTTCCGCCACAGG 

TOTACAGCTGACAGAGAACGAGATCCGCGGTCTGTGCCTGAAATCCCGGGAGATTTTTCT 
AOTCTTCTGQAGCTGOAQQCACC^ 

GACTATTTGAGTATGGCGGTTTCCCTCCCGAGAGCAACTACCTCTTTCTGGGGGACTATG 

' CAAGCAGTCCTTGGAGACCATCTGCCTGCTGCT^^ 
CTCCGTGGGAACCACGAGTGTGCCAGCATCAACCGCATCTATGGTTTCTACGATGAGTGCAAGAGACGCT 

ACAACATCAAACTGTGGAAAACCTTCACTGACTGCTTCAACTGCCTGCCCATCGCGGCCATAGTGGACGA 

AAAGATCTTCTGCTGCC^CGGAGGCCTGTCCCCGGACCTGCAGTCTATGGAGCAGATTCGGCGGATCATG 

CGGCCCACAGATGTGCCTGACCAGGGCCTGCTGTGTGACCTGCTGTGGTCTGACCCTGACAAGGACGTGC 

AGGGCTGGGGCGAGAACGACCGTGGCGTCTCTTTTACCTTTGGAGCCGAGGTGGTGGCCAAGTTCCTCCA 

CAAGCACGACTTGGACOTCATCTGCCGAGCACACCAGGTGGTAGAA^ 

CGGCAGCTGGTGGCACTTTTCTCAGCTCCCAACTACTGTGGCGAGTTTGACAATGCTGGCGCCATGATGA 

GTGTGGACGAGACCCTCATGTGCTCTTTCCAGATCCTCAAGCCCGCCGACAAGAACAAGGGGAAGTACGG 

GCAGTTCAGTGGCCTGAACCCTGGAGGCTGACCCATCACCCCACCCCGCAATTCCGCCAAAGCCA^ 

TAGCCCCCGCACACCACCCTGTGCCCCAGATGATGGATTGATTGTACAGAAATCATGCTGCCATGCTGGG 

GGGGGGTCACCCCGACCCCTCAGGCCCACCTGTCACGGGGAACATGGAGCCTTGGTGTATTTTTCTTTTC 

TTTTTTTAATGAATCAATAGCAGCGTCCAGTCCCCCAGGGCTGCTTCCTGCCTGCACCTGCGGTGACTGT 

GAGCAGGATCCTGGGGCCGAGGCTGCAGCTCAGGGCAACGGCAGGCCAGGTCGTGGGTCTCCAGCCGTGC 

TTGGCCTCAGGGCTGGCAGCCGGATCCTGGGGCAACCCATCTGGTCTCTTGAATAAAGGTCAAAGCTGGA 

TTCTC 

muTianPPPlCAmRNAsequen.ee- var7 (public gi: 3 0582096) (SEQ ID NO: 107) 

ATGTCCGACAGCGAGAAGCTCAACCTGGACTCGATCATCGGGCGCCTGCTGGAAGTGCAGGGCTCGCGGC 
CTGGCAAGAATGTACAGCTGACAGAGAACGAGATCCGCGGTCTGTGCCTGAAATCCCGGGAGATTTTTCT 

GAGCCAGCCCATTCTTCTGGAGCTGGAGG 

GACCTTCTGCGACTATTTGAGTATGGCGGTTTCCCTCCCGAGAGCAACTACCTCTTTCTGGGGGACTATG 
TGGACAGGGGCAAGCAGTCCTTGGAGACCATCTGCCTGCT 

CTTCTTCCTGCTCCGTGGGAAC(^CGAGTGTGCCAGCATCAACCGCATCTATGGTTTCTACGATGAGTGC 
AAGAGACGCTACAACATCAAACTGTGGAAAACCTTCACTGACTGCTTCAACTGCCTGCCCATCGCGGCCA 

TAGTGGACGAAAAGATCTTCTGCTGCCACGGA<^ 

GCGGATCATGCGGCCCACAGATGTGCCTGACCAGGGCCTGCTGTGTGACCTGCTGTGGTCTGACCCTGAC 
AAGGACGTGCAGGGCTGGGGCGAGAACGACCGTGGCGTCTCTTTTACCTTTGGAGCCGAGGTGGTGGCCA 

AGTTCCTCCACAAGGACGACTTGGACCTCATCTGCCG 

CTTTGCCAAGCGGCAGCTGGTGACACTTTTCTCAGCTCCCAACTACTGTGGCGAGTTTGACAATGCTGGC 
GCCATGATGAGTGTGGACGAGACCCTCATGTGCTCTTTCCAGATCCTCAAGCCCGCCGACAAGAACAAGG 
GGAAGTACGGGCAGTTCAGTGGCCTGAACCCTGGAGGCCGACCCATCACCCCACCCCGCAATTCCGCCAA 

AGCCAAGAAATAG 

Human PPP1CA mRNA sequence - var8 (public gi: xsosxs) (SEQ ID NO: 108) 

GGGCAAGGAGCTGCTGGCTGGACGGCGGCATGTCCGACAGCGAGAAGCTCAACCTGGACTCGATCATCGG 
GCGCCTGCTGGAAGTGCAGGGCTCGCGGCCTGGCAAGAATGTACAGCTGACAGAGAACGAGATCCGCGGT 

CTGTGCCTGAAATCCCGGGAGATTTTTCTGAGCCAGCC 

TCTGCGGTGACATACACGGCCAGTACTACGACCTTCTGC^ACTATTTGAGTATGGCGGTTTCCCTCCCXSA 

GAGCAACTACCTCTTTCTGGGGGACTATGTGGACAGGGGCAAGCAGTCCTTGGAGACCATCTGCCTGCTG 

CTGGCCTATAAGATCAAGTACCCCGAGAACTTCTTCCTGCTCCGTGGGAACCACGAGTGTGCCAGCATCA 

ACCGCATCTATGGTTTCTACGATGAGTGCAAGAGACGCTACAACATCAAACTGTGGAAAA 

CTGCTTCAACTGCCTGCCCATCGCGGCCATAGTGGACGAAAAGATCTTCTGCTGCCACGGAGGCCTGTCC 

CCGGACCTGCAGTCTATGGAGCAGATTCGGCGGATC^^ 

TGTGTGACCTGCTGTGGTCTGACCCTGACAAGGACGTGCAGGGCTGGGGCGAGAACGACCGTGGCGTCTC 

TTTTACCTTTGGAGCCGAGGTGGTGGCCAAGTTCCTCCACAAGCACGACTTGGACOT 

CACCAGGTGGTAGAAGACGGCTATGAGTTCTTTGCCAAGCGGCAGCTGGTGACACTTTTCTCAGCTCCCA 

ACTACTGTGGCGAGTTTGACAATGCTGGCGCCATGATGAGTGTGGACGAGACCCTCATGTGCTCTTTCCA 

GATCCTCAAGCCCGCCGACAAGAACAAGGGGAAGTACGGGCAGTTCAGTGGCCTGAACCCTG 

CCCATC^CCCCACCCCGCAATTCCGCCAAAGCCAAGAAATAGCCCCCGCACACCA 

GATGGATTGATTGTACZAGAAATCATGCTGCCATGCTGGGGGGGGGTCACCCCGACCCCTAAGGCCCACCT 
GTCACGGGGAACATGGAGCCTTGGTGTATTTTTCTTTTCTTTTTTTAATGAATCAATAGCAGCGT 

CCCCC^GGGCTGCTTCCTGCCTGCACCTGCGGTACT 

GGGCAACGGCAGGCCAGGTCX3TGGGTCTCGAGCCGTGC 

CAACCCATCTGGTCTCTTGAATAAAGGTCAAAGCTGG 
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Human PPPlCAmRNA sequence- var9 (publicgi:.i90280)(SEQIDNO: 109) 

CGGCCTGGCAAGAATGTACAGCTGACAGAGAACGAGATCCGCGGTCTGTGCCTGAAATCCCGGGAGATTT 

TTCTGAGCCAGCCCATTCTTCTGGAGCTGGAGGCACCCCTCAAGATCTGCGGTGACATACACGGCCAGTA 

CTACGACCTTCTGCGACTATTTGAGTATGGAGGTTTCCCTCCCGAGAGCAACTACCTCTTTCTGGGGGAC 

TATGTGGACAGGGGCAAGCAGTCCTTGGAGACCATCTGCCTGCTGCTGGCCTATAAGATCAAGTACCCCG 

AGAACTTCTTCCTGCTCCGTGGGAACCACGAGTGTGCCAGCATCAACCGCATCTATGGTTTCTACGATGA 

GTGCAAGAGACGCTACAACATCAAACTGTGGAAAACCTTCACTGACTGCTTCAACTGCCTGCCCATCGCG 

GCCATAGTGGACGAAAAGATCTTCTGCTGCC^CGGAGGCCTGTCCCCGGACCTGCAGTCTATGGAGCAGA 

TTCGGCGGATCATGCGGCCCAC^GATGTGCCTGACCAGGGCCTGCTGTGTGACCTGCTGTGGTCTGACCC 

TGACAAGGACGTGCAGGGCTGGGGCGAGAACGACCGTGGCGTCTCTTTTACCTTTGGAGCCGAGGTGGTG 

GCCAAGTTCCTCCACAAGCACGACTTGGACCTCA.TCTGCCGAGCACACCAGGTGGTAGAAGACGGCTACG 

AGTTCTTTGCCAAGCGGCAGCTGGTGACACTTTTCTC 

TGGCGCCATGATGAGTGTGGACGAGACCCTCATGTGCTC^^ 

AAGGGGAAGTACGGGC^GTTCAGTGGCCTGAACCCTGGAGGCCGACCGATCACCCC^ 
CCAAAGCCAAGAAATAGCCCCCGCACACCACCCTGTGCCCCAGATGATGGATTGATTGTACAGAAATCAT 
GCTGCCATGCTGGGGGGGGGTCACCCCGACCCCTCAGGCCCACCTGTCACGGGGAACATGGACCTTGGTG 
TATTTTTCTTTTCTTTTTTTAATGAATCAG 

Human PPP1CA protein sequence - varl (public gi: 298964) (SEQ ID NO: 261) 

MSDSEKLNLDS I IGRLLEGSRVLTPHCAPVQGSRPGKNVQLTENE IRGLGLKSRE I FLSQPI LLELEAPL 
KI CGDI HGQYYDLLRLFE YGGFPPE SN YLFLGD YVDRGKQS LET I CLLLAYKI KYPENFFLLRGNHECAS 
INRIYGFYDECKRRYNIKLWKTFTDCFNCLPIAAIVDEKIFCCTGGLSPD 

LLCDLLWSDPDKDVQGWGENDRGVS FTFGAE WAKFLHKHDLDLI CRAHQWEDGYEFFAKRQL 
PNYCGEFDNAGAMMSVDETLMCS FQ I LKPADKNKGKYGQFSGLNPGGRP I TP PRNSAKAKK 

Human PPP1CA protein sequence - var2 (pubHcgi: 190 516) (SEQ ID NO: 262) 

MSDSEKLNLDSIIGRLLEVQGSRPGKNVQLTENEIRGLCLKSREIFLSQPILLEL^^ 

DLLRLFEYGGFPPESNYLFLGDYVDRGKQSLETICLLLAYKIKYPENFFLLRGNHEC^im 

KRRYNIKLWKTFTDCFNCLPIAAIVDEKIFCCHGGLSPDLQS^QIRRIMRP^^ 

KDVQGWGENDRGVS FTFGAEVVAKFLHKHDLDLI crahqwedgyeffakrqlvtlfs apnycge fdnag 

ammsvdetlmcsfqilkpadknkgkygqfsglnpggrpitpprnsakakk 

Human PPP1CA protein sequence - var3 (public gi: 190281) (SEQ ID NO: 263) 

rpgknvqlteneirglclksreiflsqpilleleaplkicgdihgqyydllrlfeyggfppesnylflgd 

yvdrgkqsleticlli^ykikypenffllrgnhecasii^iygfydeckrryniklwktftdcfnclpia 

aivdekifcchgglspdlqsmeqirrimrptdvpdqgllcdllwsdpdkdvqgwgendrgvsftfgaew 

akflhkhdldli crahqwedgye ffakrqlvtlfs apnycge fdnagammsvdetlmcs fq i lkpadkn 

kgkygqfsglnpggrpitpprnsakakk 

Human PPP1CA protein sequence - (public gi: 35451) (SEQ ID NO: 395) 

MSDSEKLNLDS I IGRLLEVQGSRPGKNVQLTENEIRGLCLKSREI FLSQPILLELEAPLKICGDIHGQYY 
DLLRLFEYGGFPPESNYLFLGDYVDRGKQSLETICLLLAYKIKYPENFFLLRGNHEC^INRIYGFYDEC 
KRRYNIKLWKTFTDCFNCLPIAAIVDEKIFCCHGGLSPDIiQSMEQIRRIMRPTDVPDQGLLCDLLWSDPD 
•KDVQGWGENDRGVSFTFGAEWAKFLHKHDLDLICRAHQWEDGYEFFAKRQLVTLFSAPNYCGEFDNAG 
AMMSVDETLMCS FQILKPADKNKGICYGQ FSGLNPGGRP I TPPRNSAKAKK 



Human PPP1CA pray sequence - varl (SEQ ID NO: 1 10) 

CCGCCTGGTNCTACCCATGACNCACNTACCANTATTACGTCT^ 
ATTCCACACAGAATACAAGTGCGCTCAT 

TAGGGCNAGNGCCCCCNTGGANTTCCNNTACAACNTNCCAGGATNACGOT 
TGAATTCCACCGAAGCGGTGGTATCAACGCAGAGTC 

GCCGACCCATCACCCCACCCOTCAATTCCGCCAAAGCCAAGAAATAGNNGGCGCACACCACCCTGTGCCT 
TNNATGATGGATTGATTGTACAGAAATCATGCTGCCATGCTGGGGGGGGG 

Unigene Name; PRKAR1A Unigene IDs Hs . 28 0342 

Human PRECAR1 A mRNA sequence - varl (public gi: 34530409) (SEQ ID NO: 111) 

ATCGCAGAGTGGAGCGGGGCTGGGAGCAAAGCGCTGAGGGAGCTCGGTACGCCGCCGCCTCGCACCCGCA 
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GCCTCGCGCCCGCCGCCGCCCGTCCCCAGAGAACCATGGAGTCTGGCAGTACCGCCGCCAGTGAGGAGGC 

ACGCAGCCTTCGAGAATGTGAGCTCTACGTCCAGAAGCATAAG 
GTGCAGTTGTGCACTGCTOGACCTGAGAGACCCATGG 

Laac^ttcagaatctg^ 

CC^TCAACCCAGTGGT^ 

ATGCGGCATCCTATGTTAGAAAGGTTATACC^AAAGATT^ 

TGAAAAGAATGTG CTGTTTTCACAT CTTGATGATAATGAGAGAAGTGATATTTTTC^TGCCATGTTTTCG 
GTCTCCTTTATCGCAGGAGAGACTGTGATTCAGCAAGGTGATGAAGGGGATAACTTCTATGTGATTGATC 
AAGGAGAGACGGATGTCTATGTTAACAATGAATGGGC^CCAGTGTTGGGGAAGGAGGGAGCTTTOTAGA 

ACTTGCXTXGATO ^^ 

ATCGACCGAGACAGCXATAGAAGAATCCTCATGGGAAGCACACTGAGAAAGCGGAAGATGTATGAGGAAT 
TCCTTAGTAAAGTCTCTATTTTAGAGTCTCTGGACAAGTGGGAACGTCTTACGGTAGCTGATGCATTGGA 
ACCAGTGCAGTTTGAAGATGGGCAGAAGATTGTGGTGCAGGGAGAACCAGGGGATGAGTTCTTCATTATT 
TTAGAGGGGTCAGCTGCTGTGCTACAACGTCGGTCAGAAAATGAAGAGTTTGTTGAAGTGGGAAGATTGG 

GGCCCTCTGATTATTTTGG^^ 

TGGCCCCTTGAAGTGCGTTAAGCTGGACCGACCTAGATTTGAACGTGTTCTTGGCCCATGCTCAGACATC 

CTCAAACGAAACATCCAGCAGTACAACAGTTTTGTG 
CCTTTTCTCCTCTCCCCAATCCATGCT 

GTGGCCACTGGCATCGCAGCTTCCTGTCTGTTTATATATTGAAAGTTGCTTTTATTGCACCATTTTCAAT 
TTGGAGCATTAACTAAATGCTCATACACAGTTAAATAAATAGAAAGAGTTCTATGGAGACTTTGCTGTTA 

CTGCTTCTCTTTGTGCAGTGTTAGTATTCACCCTGGGCAGTGAGTGCCATGCTTTTTO 
CCCAGCACCTATTGAATTACCATAGAGTAATGATGTAACAGTGCAAGATTTTTTTTTTTAAGTGACATAA 
TTGTCCAGTTATAAGCGTATTTAGACTGTGGCCATATATGCTGTATTTCTTTGTAGAATAAATGGTTTCT 
CATTAAACTCTAAAGATTAGGGAAAATGGATATAGAAAATCraAGTATAGTAGAAAGACATCTGCCTGTA 
ATTAAACTAGTTTAAGGGTGGAAAAATGCCCATTTTTGCTAATTATCAATGGGATATGATTGGTTCAGTT 
TTTTTTTTTTTCCAGAGTTGTTGTTTGCCAAGCTAATCTGCCTGGTTTTATTTATATCTTGTTATTAATG 
TTTCTTCTCCAATTCTGAAATACTTTTGAGTATGGCTATCTATACCTG^ 
'CATAGATTGCAAATATTGGTTAGTATTTAACTACAT 

TCCAGCTAGTGCCAAATAATTGATGAGATGCTGAAOTGAGAATAAGAATTTGAGGTCTACATTCTTGGTT 

GTTAATTTAGAGCGTTTGGTTAAAGTATGTCCTTCAGCTGACTCCAGTATAATCrCCTCTGCTCATTAAA 

CTGATTCCAGGAGATTGGATTTGCTGTGACTAGATACAGATGGAGCAAATGTCCTAACAGAGAAATAGAG 

GTGATGCTGCTAAAGGGAGAAATGCCAGGCGGACAAAGTTCAGTGTCGGGAATTTTCCCCGTGACATTCA 

CTGGGGCATGAGATTTTGGAAGAAGTTTTTTACTTTGGTTTAGTCTTTTTTTCCTTCCTTTTTATTCAGC 

TAGAATTTCTGGTGGGTTGATGGTAGGGTATAATGTGTCTGTGTTGCTTCAAATTGGT 

CCTGCTGAAAGTCCTGCTTTCCTATCTAGCATT^ 

AAGTAAACTTGTGTATTGAGTCTTAACTGTATTTCAGTATTTTCCAGCCTTATGTGTTACATTATTCCAA 

TGATACCCAACAGTTTATTTTTATTATTTTTTTAAACAAAATTTCACAGTTCTGTAA 

ATTTTCATTGTGATTTATATATAAGGTAATGTAGGGTTATATTTGGGAGTGACTGCAAGCATTTTT 

CTGTGTGCAACTAACTGACTCTGTTATTGATCCCTTCTCCTGCCCTTTCCCAGGTAATTTAAATTGGTCA 

TGGTAGATTTTTTTCATAGATTTGAAAAACTTTTAGGTTGTTACC^GTATGAAGTATAAATCrGGGG^ 

GAGGTTTTATTTACATTTTAGGGTGGGTAAGAAAGCCACCTTGTTACAAATTTTTTAATTTCCAAAAT^ 

TCTATATTAAATGAGGGTTTCTGATCTGTACTTTGTGTTTAGCTACCTTTTTATATTTAAAAAATTAAAA 

ATGAAAATTATGTTCTTACAAGCTTAAAGCTTGATTTGATCTTTGTTTAAATGCCAAAATGTACTTAAAT 

GAGTTACTTAGAATGCCATAAAATTGCAGTTT C ATGTATGTAT ATAAT CATGCT CATGTATATTTAGTT A 

CGTATAATGCTTTCTGAGTGAGTTTTACTCTTAAATCATTTGGTTAAATCATTTGGCTTGCTGTTTACTC 

CCTTCTGTAGTTTTTAATTAAAAGCTTTAAAGATAAGTCTAC^TTAAACAATGATC^CATCTAAAGCTTT 

ATCTTTGTGTAATCTAAGTATATGTGAGAAATCAGAATTGGCZATAATTTGTCTTAGTTGATATTCAAGGC 

TTTAAAAGTCATTATTCCTGGGCTTGGTAAGTGAATTTATGAGATTTACTGCTCTAGAAAGTATAGATGG 

CCAAAGGACCX3TTTTGTATTGCTTCCTGATTACCAGTCTGATTATACCATGTGTGCTAATATACTTTTTT 

TGTTATAGATTGTCTTAATGGTAGGTCAAGTAATAAAAAGAGATGAAATAATTT 

Human PRKAR1 A mRNA sequence - var2 (public gi: 4 8 8427 9) (SEQ ID NO: 1 12) 

TATTTTCCAGCCTTATGTGTTACATTATTC 

AAAATTTCACAGTTCTGTAATGTAGGCACTTTTATTTTCATTGTGATTTATATATAAGGTAATGTAGGGT 

TATATTTGGGAGTGACTGCAAGCATTTTTCCATCTGTGTGCAACT 
TCCTGCCCTTTCCCAGGTAATTTAAATTGGTCATC 

TTGTTACCAAGTATGAAGTATAAATCTGGGGAAGAGGTTTTATTTACATTTTAGGGTGGGTAAGAAAGCC 
ACCnTGTTACAAATTTTTTAATTTC 

TTTAGCTACCTTTTTATATTTAAAAAATTAAAAATGAA^ 

GATCTTTGTTTAAATGCCAAAATGTACTTAAATGAGTTACTTAGAATGCC^ 

ATGTATATAATCATGCTCATGTATATTTAGTTACGTATAATGCTTTCTGAGTGAGTTTTACT 

ATTTGGTTAAATCATTTGGCTTGCTGTTTACTC^ 

TCTACATTAAACAATGATCACATCTAAAGCTTTATCTTTGTGTAATCTAAGTATATGTGAGAAATCAG^ 
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TTGGCATAATTTGTCTTAOTT^TATTO^GGCTTTA^ 

CTGATTATRCCATGTGTGCTAATATACTTTTTTTGTTATAGAT^ 
AAGAGATGAAATAATTTAAAAAAAAAAAAA 

"Human PRKARlA rnRNA sequence - var3 (public gi: 33 636720) (SEQ ID NO: 113) 
ag?aSgS^ 

AAGCGCTGCTCAAAGATTCTATTGTGC^GTTGTGCACTGCTCGAGCTGAGAGACCCATGGCIATTCCTCAG 



GTGCTATCAGCGCTGAGGTCTACACGGAGGAAGATGCGGCATCCTATGTTA 

Sacaagacaatg^ 

GAGAGAAGTGATATOT^TO 

gSgaaggg^^^^ 

AACCAGTGOTGGGGAAGGAGGGAGCTTTGGAGAACTTGCTTTGATTTATGGAACACCGAGAG 
G^CAAA^CAAAGACAAATGTGAAA^ 

GCACACTGAGAAAGCGGAAGATGTATGAGGAATTCCTTAGTAAAGTCTCTATTTTAGAGTCTCTGGACAA 
GTGGGAACGTCTTACGGTAGCTGATGCATTGGAA^ 

cagggaSc^ 

gaatcgtcctcgtgctgccaca^^ 
SJgaacgStcttc^ 

SSg^c^Sgaaatctgcctcctgtgcctcccttttctcct^ 
aa^ctgottattttccctacttg^ 

a^tgaaagttgcttttattgcaccattttcaatttggagcattaactaa 
SaSgIgtStggagact 

CAGTGAGTGCCATGCTTTTTGGTGAGGGCA 

ACAGTCCAAGATTTTTTTTTTAAGTGACATAATTGTCCAGTTATAAGCGTATT^ 
TOCTGTATTTCTTTG^AGAATAAATGGTTTCTCATTAAA^CTCTAAAGAT^ 

atcttagtatagtagaaagacatctgcctgtaattaaactagtttaagggtggaaaaatgcccatttttg 
ctaattatcaatgggatatgattggttcagtttttttttttccagagttgttgtttgccaagctaatctg 
cctggttttatttatatcttgttattaatgtttcttctccaattctgaaatacttttgag^tggctotc 

Sacctgccttttaagtttg^ 

CTCGGCTCAGAAATTCCGATTAGACCTTTATCCAGCTA 

AATAAGAATTTGAGGTCTACATTCTTGGTTGTTAATTTAGAGCGTTTGGTTAAAGTATGTCCTTCAGCTG 
ACTCCAGTATAATCTCCTCTGCTCATTAAACTGATTCCAGGAGATTGGATTTC 

TGGAGCAAATGTCCTAACAGAGAAATAGAGGTGATGCTOCTAAAGGGAGAAATGCCAGGCGGACAAAGTT 

CAGTGTCGGGAATTTTCCCCGTGACATTCACTGGGGCATGAGATTTTGGAAGAAGTTTTTTAC^ 

TAGTCTTTTTTTCCTTCCTTTTTATTCAGCTAGAATTTCTGGTGGGTTGATGGTAGGGTATAATGTGTCT 

gtgttgcttcaaattggtctgaaaggctatcctgcggaaagtcctgctttcctatctagcatttatctct 
ctggcaaacttttctttcttttcttttttaaagtaaacttgtgtattgagtcttaactgtatttcagtat 
tttccagccttatgtgttacattattccaatgatacccaac^gtttatttttattattttct^ 

ATTTCACAGTTCTGTAATGTAGGCACTTTTATO^ 

ATTTGGGAGTGACTGC^GCATTTTTCCATCTGTGTGCAACTAACTGACTCTGTTATTGATCCCTTCTCC 
TGCCCTTTCCCAGGTAATTTAAATTGGTCATGGTAGATTTTTTTCATAGATTTGAAAAACTTTTAGGTTG 
TTACCAAGTATGAAGTATAAATCTGGGGAAGAGGTTTTATTTACATTTTAGGGTGGGTAAGAAAGCCACC 
TTGTTACAAATTTTTTAATTTCCAAAATAATCTATATTAAATGAGGGTTTCTGATCTGTACTTTGTGTTT 

AGCTACCTTTTTATATTTAAAAAATTAAAAATGAAAATTACGTTCTTACAAGCTTAAAGCTTG 

C1TTGTTTAAATGCCAAAATGTACTTAAATGAGTTACTTAGAATGCCATAAAATTGCAGTTTCATGTATG 

TATATAATCATGCTCATGTATATTTAGTTACGTATAATGCTTTCTGAGTGAGTTTTACTCTTAAATCATT 

TGGTTAAATCATTTGGCn?TGCTGTTTACTCCCTTCTGTAGTTTTTAATTAAAAACTTTAAAGATAAGTCT 

ACATTAAACAATGATCACATCTAAAGCTTTATCTTTGTGTAATCTAAGTATATGTGAGAAATCAGAATTG 

GCATAATTTGTCTTAGTTGATATTCAAGGCTTTAAAAGTCATTATTCCTGGGCTTGGTAAGTGAATTTAT 

GAGATTTACTGCTCTAGAAAGTATAGATGGCGAAAGGACCGTTTTGTATTGCTTCCTGATTACCAGTCTG 

ATTATACCATGTGTGCTAATATACTTTTTTTGTTATAGATTGTCTTAATGGTAGGTCAAGTAATAAAAAG 

AGATGAAATAATTTAAAAAAAAAAAAAAAA 

Human PRICAR1 A niRNA sequence - var4 (public gi: 1526989) (SEQ ID NO: 114) 

GCTGGGAGCAAAGCGCTGAGGGAGCTCGGTACGCCGCCGCCTCGCACCCGCAGCCTCGCGCCCGCCGCCG 
CCCGTCCCCAGAGAACCATGGAGTCTGGCAGTACCGCCGCCAGTGAGGAGGCACGCAGCCTTCGAGAATG 
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TGAGCTCTACGTCCAGAAGCATAACATTCAAGCGCTC 



ATOTOCTGTTTTC^ 

tatcgcaggagac 
gtcagctgctgtgctacaacgtcggtcagaaaatgaagagt^ 

GaSa^GGTGAAATTO^^ 

?5aIctgcg™gctggaccgacctagatttgaacgtgttcttggcccatgctcagacatcctc 

AAACATCC^CAGTACAACAGTTTTGTGTCA 

^cicCC^ckTGCTTCACTCATGCAAACTGCTTTATTTTCCCTACTTGCAGCGCCAAGTGGCCAC 
TGGCATCGCAGCTTCCTGTCTGTTTATATAT^ 



TTAAX^AAAXGCTCATM 



CTI 

CTAAAGA^AGGGAAATGGATATAGAAAATCTTAGTATAGTAGAAAGAC^TCTGCCTGTA^ 

Sagttgttgtttck:^ 

ctgaaatact^tgagtatggctatctatacctgccttttaagtttgaaactaac^ • 

SgG^AGtSStLctA^ 

ttggctaaagtatgtccttcagctga^ 
JgS^gtgact^^^ 

■^GGAAG^SgTT^CTTTA^ 

cStcctmctagc^^ 
^gIot^a^S^ 

tacaagcttaaagcttgatttgatct 

Human PRKAR1 A mRNA sequence - var5 (public gi: 1525988) (^Q ^0:115) 

ctcgcgcccgccgccgcccgtccccagacsaaccatggagtctggcagtaccgccgc^^ 

gSgcScgagaatgtgagctctacgtccagaagcataacattcaagc^ 
gcagttgtgcactgctcgacctgagagacccatggcattcctcagggaatactttgagaggttggagaag 

ctcctcctccacccaacccagtggttaaaggtaggaggcgacgaggtgctatcagcgctgagg^acac 
aaJgccattgaaaagaatgtgctc™ 

tgttttcggtctcctttatcccaggagagactgtgattcagcarggtgatgaaggggataaot 

gattgatcaaggagagacggatgtctatgttaacaatga^^^ 
tttggagaacttgctttgatttatggaacaccgagagcagccactgtcaaagc^ 

tgtggggcatcgaccgagacagctatagaagaatcctcatggga^^ 

tgaggaattccttagtaaagtctctaxtttagagtctctggacaagtgggaacgtcttacggtagctgat 
gcattggaaccagtgcagtttgaagatgggcagaagattgtggtgcagggagaaccaggggatgag^ct 
tcattattttagaggggtcagc^ctgtgctacaacgtcggtcagaaaatgaac^gtttgttgaagtggg 

AAGATTGGGGCCTTCTGATTATTTTGGTGAAATTGCACra 
GTTCCTCGTGGCCCCTTGAAGTGCGTTAAGCTGGACCGACCTAG 

CAGACATCCTCAAACGAAACATCCAGCAGTACAACAGTTTTGTGTCACTGTCTGTCTGAAATCTGCC 
TGTGCCTCCCTTTTCTCCTCTCCCCAATCCATGCTTCACTCATGCAAACTGCTT^ 
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AGCGCCAAGTGGCCACTGGCATCGCAGCTTCCTGTOT 

TTTCAATTTGGAGCATTAACTAAATGCTCATAC^ 

AAAAAA 

Human PRKAR1A mRNA sequence - var6 (public gi: 9956010) (SEQ ID NO: 116) 

AACTGACTCTGTTATTGATCCCTTCTCCTGCCCTTTCCCAGGTAATTTAAATTGGTCATGGTAGATTTTT 

TTCATAGATTTGAAAAACTTTTAGGTTGTTACCAAGTATGAAGTATAAATCTGGGGAAGAGGTTTTATTT 

ACATTTTAGGGTGGGTAAGAAPlGCCACCTTGTTACAAATTTTTTAATTTCCAAAATAAT . 

GAGGGTTTCTGATCTGTACTTTGTGTTTAGCTACCTTTTTATATTTAAAAAATTAA 

TTCTTACAAGCTTAAAGCTTGATTTGATCTTTGTTC^ 

ATGCCATAAAATTGCAGTTTC^TGTATGTAT^^ 

TCTGAGTGAGTTTTACTCTTAAATCATTTGGTTAAAT 

TTTAATTAAAAACTTTAAAGATAAGTCTA^^ 

TCTAAGTATATGTGAGAAATCAGAATTGGC^^ 

TATTCCTGGGCTTGGTAAGTGAATTTATGAGATTTACTGCTCTAGAAAGTATAGATGGCCAAAGGACCGT 
TTTGTATTGCTTCCTGATTACCAGTCTGATTATACCATGTGTGCTAATATACTTTTTTTGTTATAGATTG 
TCTTAATGGTAGGTCAAGTAATAAAAAGAGATGAAATAATTTAAArTCTTAAATGAATCAGTTTTTCTTC 

CCTTTCTCCTTTCCGTCITTCCTCrCTCT 

GGGGGAAAGTGAATTATGGGATCGGTGTTTTGAAAGAGCAATGTTTATTTTCAGTGCTTTTCAGTTTGTC 
ATU^GAGTGGATCTCAAAATCTTGCTTAAAGGGTAATTGAGATGTAGCAGATTTATTTACTTAGTCATGGA 
AAGAAAAAAATTCAGTCAAAAGCTAAAGATTTCCTTTTGATTGAAGACAGATTGGTTCTGTGGCCTTGGA 
ACTTTCCCAGACTTAATGGGGAAACATCATTTCTAGATTAGCATACTCTTTGGTTTAAATTTAATATATA 
CATTTAATGTTACTTAGGGATACTTTTATATTTTGCATATATAAAGCCTCATATATAAAGCCTTATTTCT 
GATGCTCTTAGATTTCTGAGGAGTGAGATGATTAAGTTGTATTCATTAGTGTATTGGTATTTCTTCACAT 
CCAGTGAAATTGGAGATATGTTGTATGTTAGAAGAGCATTCTTTAAATTGTGTTGCTTTGAACATGTGTA 
CCTTTTCTAGATTCAGTAATCCCTTCCCCCCGTCCTCTGGAGTATGAAACCTTTAGAGTCACAATAAAAT 
GTAACTAAAGAAAAAAAAAAAAAAAAAAAAAAAAA 

Human PRKAR1 A mRNA sequence - var7 (public gi: 2 17^7 J5S) (SEQ* ID NO: 117) 

TAATTTTCTTGTGTGTTTTTAAAAATTTTGATTATGCTAGTAGTTGGCTAATCAGATCCTCACTCCAGTG ■ 

GTTTGCTCTGTGACGTTAGGATACTCCCATGGGATAGAAGTTACGTATAGGGAATGTCAGATATTCTTCA 

TTGTGCTGACTTGCTTTCGCTTACAGTTG^ 

CCTACTTCCCACGTCATCATGATTTCTTTTGAGGGAGAACTGJ^ 

AGCACCCGTCTCTGCAGAGGTTAGTGGCTCATACTTCCTCCCAGGAGCTGAGGTTATCGACTCTCACTGT 

TGCCTACAGAGCACAGATCCTGAACTAAATGAAACATTTACTTGGAATAATGCTAATTCTGTACATATTT 

TATTCCCTAGTCCCCACTTCCCTGTTTAAAAACAAAATCTACTTAGAAAAAAATCCCTGTGAATCAGTTG 

TCTAATGAATTTAGCAAGTTAAATGCCAGATTGACATTTTGCTTTATAGTTTATACAAGCATGT 

TTTTTTCTCGCAGAGAACC^TGGAGTCTGGCAGTACCGCCGCC^GTGAGGAGGCACG(^GCCTTCGAG^ 

TGTGAGCTCTACGTCCAGAAGCATAAC^TTCAAGCGCTGCTCAAAGATTCTATTGTG 

CTCGACCTGAGAGACCCATGGCATTCCTCAGGGAATACTTTGAGAGGTTGGAGAAGGAGGAGGCAAAACA 

GATTCAGAATCTGCAGAAAGCAGGCACTCGTAC^ 

AACCCAGTGGTTAAAGGTAGGAGGCGACGAGGTGCTATCAGCGCTGAGGTCTACACGGAGGAAGATGCGG 
CATCCTATGTTAGAAAGGTTATACCAAAAGATTACAAGACAATGGCCGCTTTAGCCAAAGCGAT 
• GAATGTGCTGTTTTCACATCTTGATGATAATGAGAGAAGTGATATTTTTGATGCCATGTTTTCGGTCTCC 
TTTATCGCAGGAGAGACTGTGATTCAGCAAGGTGM^ 
AGACGGATGTCTATGTTAACAATGAATGGGCAACC^^ 
TTTGATTTATGGAACACCGAGAGCAGCCACTGTCAAAGCAAA^ 

CGAGACAGCTATAGAAGAATCCTCATGGGAAGCACACTGAGAAAGCGGAAGATGTATGAGGAATTCCTTA 

GTAAAGTCTCTATTTTAGAGTCTCTGGACAAGTGGGAACGTCTTACGGTAGCTGATGCATTGGAACCAGT 

GCAGTTTGAAGATGGGCAGAAGATTGTGGTGCAGGGAGAACCAGGGGATGAGTTCTTCATTATTTTAGAG 

GGGTCAG CTGCTGTGCTACAACGTCGGT CAGAAAATGAAGAGTTTGTTGAAGTGGGAAGATTGGGGCCTT 

CTGATTATTTTGGTGAAATTGCACTACTGATGAAT 

CTTGAAGTGCGTTAAGCTGGACCGACCT^ 

CGAAACATCCAGCAGTACAACAGTTTTGTGTC^CT^ 

CTCCTCTCCCCAATCCATGCTTCACTCATGCAAACTGOTTTATTTTCC 

ACTGGCATCGCAGCTTCCTGTCTGTTTATATATTGAAAGTT 

CATTAACTAAATGCTCATACACAGTTAAATAAATAGAAAGAGTTCTATGG 

Human PRKAR1A mRNA sequence - varS (public gi: 1658305) (SEQ ID NO: 118) 

AGAGGCGTCAAGGGAGGCCGGAGGGAGAGTGGGGTGGACAGAGGAGCGGAGGGACGAGAGGGAAGCGCAC 
GATAGCTGCGCGGAGAGAGAGCGAAGAGCAGGAGGAGGAACAAAGGCGACCCAAGACACCCAGAGAGGGA 
CAGAGAACCATGGAGTCTGGCAGTACCGCCGCCAGTGAGGAGGCACGCAGCCTTCGAGAATGTGAGCTCT 
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3TCCAGAAGCATAACATTCAAGCGCTGCT _ _ . _ 

SACCCATGGCATTCCTCAGGGAATACTTTGAGAGGTTGGAGAAGGAGGAGGCAARACAGA'I 
GCAGAAAGCAGGCACTCGTACAGACTCAAGGGAGGATGAGATTTCTCCTCCTCCACCCAA 




Human PRKAR1A protein sequence - var2 (public gi: 1658306) (SEQ ID NO: 265) 

MESGSTAASEEARSLRECELYVQKHN'IQALiLKDS IVQIiCTARPERPMAF.IiREYFERIjEKEEAKQ IQNLQK 
AGTRTDSREDEISPPPP 

H uman PRKAR1 A pray sequence - var 1 (SEQ ID NO: 1 19) 

GCCGCCTGGTNTACCCATA^ 

CCCAAGCAGTGGCTATCAACGCAGAGTGGTAGCGGGGCATGGGAGCAAAGC^ 
CNCCGCCGCCTCNCACCCGCAGCCTCGCGCCCGCCGCCGCCCGTCCCCAGN^ 
TACCGTTTCCAGTGAGGAGGCACNCAGCCTTCGAGAATGTGAGOTCTNNGTCCAGAAGCATN 

TGCGCTNCTCAAAGATTCTNTTGTGCANTTGTGC^ 

GGAATACTTGGCGNACGNNGNOT^^ 

TNACAAACTTTTTGGACNATOAN^ 



NCCC 

Human PRKAR1A pray sequence - var2 (SEQ ID NO: 120) 

GAGCGCCGCCATGGNANT ACCCATACGACGTACCAGNATTACGCTCATATGG ^^^GG AGG CC^^TGAAT 
TCCACCCAAGCAGTGGTATCAACGCAGAGTGGTAGCGGGGCTGGGAGCAAAGCGCTGAGGGAGCTCGGTA 
CGCCGCCGCCTCGCACCCGCAGCCTCGCGCCCGCCGCCGCCCGTCCCCAGAGAACCATGGAGTCT^^G 
TACCGCCGCCAGTGAGGAGGCACGCAGCCTTCGAGAATGTGAGCTCTACGTCCAGAAGCATAACATTCAA 

GCGCTGCTCAAAGATTCTATTGTGCAGTTGTGCACT^ 
AATACTTTGAGAGGTTGGAGAAGGAGGAGGCAAAACAGAOT 

AGACTCAAGGGAGGATGAGATTTCTCCTCCTCCACC(^ACCCAGTGGTTAAAGGTAGGAGGCGACGAGGT 

GCTATCAGCGCTGAGGTCTACACGGAGGAAGATGCGGCATCCTATGTTAGAAAGGTTATACCAAAAGATT 

ACAAGACGATGGC CGCTTTAG CCAAAGCC ATTGAAAAGAATGTGCTGTTTTCACATCTTGATG ATAATGA 

GAGAAGTGATATTTTTGATGCCATGTTTTCGGTCTCCTTTATCGCAGGAGAGACTGTGATTCANCAAGGT 

GATGAAGGGGATAACTTCTATGTGATTGATCAAGGANAGACNGATGTCTATGTTAACAATGAATGGGCNA 

CCANTGTTGGGGAAGGAGGGAGCTTTGGAAAACTTGCTTTGATTNANGGAANCC 

AAACCAAAACAAA 

Human PRKAR1 A pray sequence - var3 (SEQ ID NO: 121) 

CGACGCCGCCTGGTATACCCATACGACGTACCAGTATTACGCTCATATGGCCATGGCAGGCCAGTGAATT 

CCACCCAAGCAGGTGCGATATGCATACGCGAGNAGTGAGTAACGGCGGCTGGGTAGCGAAGTCGCTGAGG 

GAGCTCGGTACNCCGCCAGCGCTCGCACCCGCANCCTCGCGCCCGCCGCCGCCCGTCCCCAGAGAACCAT 

GGAGTCTGGCAGTACCGCCGCCAGTGAGGAGGCACGCAGCCTTCGAGAATGTGAGCTCTACGTCCAGAAG 

CATAACATTCAAGCGCTGCTCAAAGATTCTATTGTGCAGTTGTGCACTGCTCGACCTGAGAGACCCATGG 

CATTCCTCAGGGAATACTTTGAGAGGTTGGAGAAGGAGGAGGCAAAACAGATTCAGAATCTGCAGAAAGC 

AGGCACTCGTAC^GACTCAAGGGAGGATGAGATTTCTCCTCCTCCACCCAACCCAGTGGTTAAAGGTAGG 

AGGCGACGAGGTGCTATCAGCGCTGAGGTCTACACGGAGGAAGATGCGGCATCCTATGTTAGAAAGGTAG 

TTTTTGATATTTGAATATCGGGGGGGATGOTTTNGGGACCCACTTGGTGGT 

ANTGATTCTTAAATCGAAAACNGGGNGGAACTTCATC^^ 

TTTTTNTAATACC^CTTNNCAANGAAAN^ 

GTTNNNNTTTCQTC'CNNNNT^ 

Human PRKAR1A pray sequence - var4 (SEQ ID NO: 122) 

CGTANCNNCGCGNGACTCGGTGACTGANGCCATGATCGCACATTACACACTATNTACCGTCTGACATCAT 
GGNTCAGTGTGCAGGGCCATGTTGANNTCTCCNC^CCATAN^ 

AGAGANATATTTTTANTACTNACTCACTATAGGGCGAGCGCCGCCATGGAGTACCCATACGACGTNCC^ 

ATTACGCTCATATGGCCATGGAGGCCAGTGAATTCCACCCAAGCAGTGGTATCAACG 

GGCTGGGAGCAAAGCGCTGAGGGAGCTCGGTACGCCGCCGCCTCGCACCCGCAGCCTCGCGCCCGCCGCC 
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GCCCGTCCCCAGAGAACCATGGAGTCTGGCNGTACCGCCNNTiVNTGN 
GTGAGCTCTACGTCCAGAAGCATAACATNNGNGCGCT^ 

TCGACCTGAGAGACCCATGGCATTCCTCAGGGAATTACTTTGAGAGGTTGGANNAGGAGGAGGCNAACCA 

NATTCANAATCTGOTGAAAGCANNAOT 
NCCNANTGGTTAAGGGTNGGAGGCNACAAGGNCTNTTNN 
ATTCCTATGTTAAAANGGGTNTTTCCNNTANNN^ 
TCNAAAAAAANGNGCNNTTTCCHSIA^^ 

TNNCGTTNTCTTTTTCNGGNGGAAACNTTNATTAANNCCG 

Unigene Name : PRKARIA Unigene ID : Hs . 183037 Clone ID : 3GD_188 
Human PRKARIA mRNA sequence - varl (public gi: 23273779) (SEQ ED NO: 396) 

GGTGGAGCTGTCGCCTAGCCGCTATCGCAGAGTGGAGCGGGGCTGGGAGCAAAGCGCTGAGGGAGCTCGG 
TACGCCGCCGCCTCGCACCCGCAGCCTCGCGCCCGCCGCCGCCCGTCCCCAGAGAACCATGGAGTCTGGC 
AGTACCGCCGCCAGTGAGGAGGCACGCAGCCTTCGAGAATGTGAGCTCTACGTCCAGAAGCATAACATTC 
AAGCGCTGCTCAAAGATTCTATTGTGCAGTTGTGCACTGCTCGACCTGAGAGACCCATGGCATTCCTCAG 
GGAATACTTTGAGAGGTTGGAGAAGGAGGAGGCAAAAGAGATTCAGAAT 

ACAGACTCAAGGGAGGATGAGATTTCTCCTCCTCCACCCAACCCAGTGGTTAAAGGTAGGAGGCGACGAG 
GTGCTATCAGCGCTGAGGTCTACACGGAGGAAGATGCGGCATCCTATGTTAGAAAGGTTATACCAAAAGA 
TTACAAGACAATGGCCGCTTTAGCCAAAGCCATTGAAAAGAATGTGCTGTTTTCACATCTTGATGATAAT 
GAGAGAAGTGATATTTTTGATGCCATGTTTTCGGTCTCCTTTATCGCAGGAGAGACTGTGATTCAGCAAG 
GTGATGAAGGGGATAACTTCTATGTGATTGATCAAGGAGAGACGGATGTCTATGTTAACAATGAATGGGC 
AACCAGTGTTGGGGAAGGAGGGAGCTTTGGAGAACTTGCTTTGAT 

GTCAAAGCAAAGACAAATGTGAAATTGTGGGGCATCGACCGAGACAGCTATAGAAGAATCCTCATGGGAA 
GCACACTGAGAAAGCGGAAGATGTATGAGGAATTCCTTAGTAAAGTCTCTATTTTAGAGTCTCTGGACAA 
feTGGGAACGTCTTACGGTAGCTGATGCATTGGAACCAGTGCAGTTTGAAGATGGGCAGAAGATTGTGGTG 
CAGGGAGAACCAGGGGATGAGTTCTTCATTATTTTAGAGGGGTCAGCTGCTGTGCTACAACGTCGGTCAG 
AAAATGAAGAGTTTGTTGAAGTGGGAAGATTGGGGCCTTCTGATTATTTTGGTGAAATTGCACTACTGAT 
GAATCGTCCTCGTGCTGCCACAGTTGTTGCTCGTGGCCCCTTGAAGTGCGTTAAGCTGGACCGACCTAGA 
TTTCAACGTGTTCTTGGCCCATGCTCAGACATCCTCAAACGAAACATCCAGCAGTACAACAGTTTTGTGT 
CACTGTCTGTCTGAAATCTGCCTCCTGTGCCTCCCTTTTCTCCTCTCCCCAATCCATGCTTCACTCATGC 
AAACTGCTTTATTTTCCCTACT 

ATTGAAAGTTGCTTTTATTGC^CCATTTTCAATTTGGAGCATTAACTAAATGCTCATACACAGTTA 

AATAGAAAGAGTTCTATGGAGACTTTGCTGTTACTGCTTCTCTTTGTGCAGTGTTAGTATTCACCCTGGG 

CAGTGAGTGCCATGCTTTTTGGTGAGGGCAGATCCCAGCACCTATTGAATTACCATAGAGTAATGATGTA 

. TGCTGTATTTCTTTGTAGAATAAATGGTTTCTCATTAAACTCTAAAGATTAGGGAAAATGGATATAGAAA 
ATCTTAGTATAGTAGAAAGACATCTGCCTGTAATTAAACTAGTTTAAGGGTGGAAAAATGCCCATTTTTG 
CTAATTATCAATGGGATATGATTGGTTCAGTTT^ 

CCTGGTTTTATTTATATCTTGTTATTAATGTTTCTTCTCCAATTCTGAAATACTTTTGAGTATGGCTATC 

TATACCTGCCTTTTAAGTTTGAAACTAACTCATAGATTGCAAATATTGGTTAGTATTTAACTACATCTGC 

CTCGGCTCACAAATTCCGATTAGACCTTTATCCAGCTAGTGCCAAATAATTGATCAGATGCTGAATTGAG 

AATAAGAATTTGAGGTCTACATTCTTGGTTGTTAATTTAGAGCGTTTGGTTAAAGTATGTCCTTCAGCTG 

ACTCCAGTATAATCTCCTCTGCTCATTAAACTGATTCCAGGAGATTGGATTTGCTGTGACTAGATACAGA 

TGGAGC7VAATGTCCTAACAGAGAAATAGAGGTGATGCTGCTAAAGGGAGAAATGCCAGGCGGACAAAGTT 

CAGTGTCGGGAATTTTCCCCGTGACATTCACTGGGGCATGAGATTTTGGAAGAAGTTTTTTACTTTG 

TAGTCTTTTTTTCCTTCCTTTTTATTCAGCTAGAATTTCTGGTGGGTTGATGGTAGGGTATAATGTGTCT 

GTGTTGCTTCAAATTGGTCTGAAAGGCTATCCTGCGGAAAGTCCTGCTTTCCTATCTAGCATTTATTTCT 

CTGGCAAACTTTTCTTTCTTTTCTTTTTTAAAGTAAACTTGTGTATTGAGTCTT 

TTTCCAGCCTTATGTGTTACATTATTCC^ 

ATTTCACAGTTCTGTAATGTAGGCACTTTTATTTTCATTGTGATTTATATATAAGGTAATGTAGGGTTAT 

ATTTGGGAGTGACTGCAAGCATTTTTCGATCTC 

TGCCCTTTCCCAGGTAATTTAAATTGGTCATC 

TTACCAAGTATGAAGTATAAATCTGGGGAAGAGGTTTTAT^ 

TTGTTACAAATTTTTTAATTT^ 

AGCTACCTTTTTATATTTAAAAAATTAAAAATGAAAATTACGTTCTTACAA^ 
CTTTGTTTAAATGCGAAAATGTACTTAAATGAGTTACOT 
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gagatttactgctctagaaagtatagatggcg; 

ATTATACCATGTGTGCTAATATACTTTTTTr 
AGATGAAATAATTTAAAAAAAAAAAAAAAA 

TGAGCTCTACGTCCAGAAGC^TAAC^TTC^ 

rrarrTC3AGAGACCC^TGGCATTCCTCAGGGAATACTTTGAGAGGTTGGAGAAGGAGGAGGCAAAACAeA 

??cagSSg^g^^ 

CCCAGTGGTTAAAGG^AGGAGGCGACGAGGTGCTATCUVGCGCTGAGGTCTAC 

aSScSSa^ 

AGTTTGAAGATGGGCAGAAGATTGTGGTGCAGGGAGAACCAGGGGATGAGTTCTTCATTA 

rTCAGCTGCTGTGCTACAACGTCGGTGAGAAAATGAAGAGTTTGTTGAAGTGGGAAGATTGGGGCCTTUl 

^ATTATTOTGGTGAAATTGCACTACTGATGAATCGTCCTCGTGCTGCCACAGTTGTTGCTCGT 

JggcaSg^c^^ 
SaSJaaSgS^^ 

CnTGTGCAGTGTTAGTATTCACCCTGGGCAGTGAGTGCCATGCTTTTTGGTGAGGGCAGATC^ 

StcSSaccatagagtaatgatgtaacagtgcaagat^^ 
SSg^gta^ 

CTAi^GATTAGGGAAATG 

S^SSgga^ 

arAOTTGTOG^TGCG^GCTAATCTGCCTGGTTTATTTATATCTTGTTATTAATGTTTCTTCTCCAATT 

^?^SStcag^tggctatctatacctgccttttaagtttc 

SaaSS^C^ 
TTGCTTAAAGTATGTC^ 

TGGAOTTGCTGTGACTAGATACAGATGGAGCAAATGTCCTAACAGAGAAATAGAGGTGATGCTGCTAAAG 
TTGGAAGAAG^TTTTACTTTGGTTTAGTCTTTTTTTCCTCCTTTTTATTCAGCTAGAATTTCTGGT 

Sgatcg^gggS^^^ 

gSc^a^atcccttc^ 
SagaSgIaa^ 

gSctca^gSctttgtgtttagctacctttttatatttaaaaaatta^ 
tacaagcttaaagcttgatttgatct 

Unigene Name: PTPN12 Unigene ID: Hs.62 
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Human PTPN1 2 jmRNA sequence - var 1 (public gi: 2 92 4 0 8) (SEQ ID NO: 123) 

AGCGACCGCAGCCGGGGGGACGCGGGAGQATGGAGCAAGTGGAGATCCTGAGGAAATTCATCCAGAGGGT 
CCAGGCCATGAAGAGTCCTGACCACAATGGGGAGGACAACTTCGCCCGGGACTTCATGCGGTTAAGAAGA 

TTGTCTACCAAATATAGAACAGAAAAGATATATCC^ 
, AGAACAGATACAAGGAI^TACTGCCATrrGATC^CAGCCGAGTTAAATTGACATTAAAGACTCCTTCAC^ 

" AGATTCAGACTATATCAATGCAAATTTTAT^ 

"JGGACCTTTAGGAAATACAGTAATAGATTTTTGGAGGATGATATGGGAGTATAATGTTGTGA 

^ TCGCCTGCCGAGAATTTGAGATGGGAAGGAAAAAATGTGAGCGCT 

CATAACGTTTGCACCATTTAAAATTTCTTGTGAGGATGAACAAGCAAGAACAG 

CTCTTACTTGAATTTCAAAATGAATCTCGTAGGCTGTATCZAGTTTGATTATGTGAACTGGCCAGACCATG 
ATGTTCCTTCATCATTTGATTOTATTCTGGACATGATAAGCTTAATGAGGAAATATCAAGAACATGAAGA 
TGTTCCTATTTGTATTCATTGCAGTGCAGGCTGTGGAAGAACAGGTGCCATTTGTGCCATAGATTATACG 

TGGAATTTACTAAAAGCTGGGAAAATACCAGAGGAATTT^ 

CACAAAGGCATTCTGCAGTACAAACAAAGGAGCAATATGAACT^ 

TGAAAAACAGCTACAACTATATGAAATTCATGGAGCTCAG 

ACTGAAAACATGATCAGCTCCATAGAGC CTG AAAAACAAGATT CTCCT C CTC CAAAACCACCAAGGAC CC 
GCAGTTGCCTTGTTGAAGGGGATGCTAAAGAAGAAA^ 

GATCTTGACACCTTCTCCCCCTTCAGC1TTTCCAACAGTCACTACTGTGTGGCAGGACAATGATA 
CATCCAAAGCCAGTGTTGCATATGGTTTGATCAGAACAACATTCAGCAGACC^ 

AATCAACAGAACTTCCAGGGAAAAATGAATCAAC^TTGAAC^GATAGATAAAAAATTGGAACGAAATTT 
AAGTTTTGAGATTAAGAAGGTCCCTCTCGAAGAGGGACCAAAAAGTTTTGATGGGAACACACTTTTGAAT 
AGGGGACATGCAATTAAAATTAAATCTGCTTCACCTTGTATAGCTGATAAAATCTCTAAGCCACAGGAAT 

TAAGTTCAGATCTAAATGTCGGTGATACTTCCCAGAATTCTTC 

CAAAGTTTCAGTTACTCCACCAGAAGAATCCCAGAATTCAGACACACCTCCAAGGCCAGACCGCTTGCCT 
CTTGATGAGAAAGGACATGTAACGTGGTCATTTCATGGACCTGAAAATGCCATACCCATACCTGATTTAT 
CTGAAGGCAATTCCTCAGATATCAACTATCAAACTAGGAAAACTGT 

ACAAGTTGAAACACCTGATCTTGTGGATCATGATAACACTTCACCACTCTTCAGAACACCCCTCAGTTTT 
ACTAATCCACTTCACTCTGATGACTCAGACTCAGATGAAAGAAACTCTC 

AAACTAATATTTCAACAGCAAGTGCCACAGTTTCTGCTGCCACTAGTACTGAAAGCATTTCTACTAGGAA 
AGTATTGCCAATGTCCATTGCTAGACATAATATAGCAGGAACAAC^CATTCAGGTGCTGAAAAAGATGTT 
GATGTTAGTGAAGATTCACCTCCTCCCCTACCTGAAAG 

ATAATACACCTGTAAGATCGGAATGGAGTGAACTTCAAAGTCAGGAACGATCTGAACAAAAA 

AGGCTTGATAACCTCTGAAAATGAGAAATGTGATCATCCAGCGGGAGGTATTCA 

GAATGTCCACCTACTTTCAGTGACAAGAGAGAACAAATATC^ 

GTTTTGGTAATCGATGTGGAAAACCCAAAGGACCAAGAGATCCACCTTCAGAATGGACATGATTCAGGGA 
GCTAGAAGACACTTTAAGTTATACTGGAAAATTCAGGTGCCACTGAAAGCCAGATTTATAGTATTCCATC 
TTTAATATGTGGGACTAACAGCAGTGTAGATTGTTACCTTAATATTTTTTGCTGGGACCATCTACCTGCC 
TTATACTACACTTAGGAAAAAGTATTACATATGGTTTATTTTGAAACTTCAAGTATTATTGCCTTAATGT 
CTCTTAACCCTGTTACACGCTGCTTGTAGACT^TGTTAATATAGTAATACCTTTATGATATATTGAGXTTA 

TTTGATAAAGTTGCAATTGAAATATTATTAACAGAAGATGTAA 
TGTACTTTATTTGTAAATTATTTGC 

TCATGCAGCCAGCTTTGCAGGTTATCAGAGATCA^GATTGTAATAATAATTTTGTAAATTGTAAGCAAA 

AAGTTATTTTTATATTATATACAGTCTAATTGTTCATCCTAATTGTTCCTGTTTTCATCTAGTCAGAGAT 

TCAGTAAGTGCCTTGGAAC^^TATTGAATTCTCTTAGCTTGTGTGTGTTTCTTTAATATTTGAAC 

TGGGATTAGAAGACTATCAAAATACATGTATGTTTCAGATATTTC 

TTTTACAGTG 

Human PTPN12 mRNA sequence - var2 (public gi: 29476876) (SEQ ID NO: 124) 

GGGGAGAGGCGGCTGCGGCTGCGGCTGCGGCTGCTGGCGGGGGGTGGGGGGGAGGAGGAACCGGGAAGGG 

GGGGCAGGGCGAGCGGAGAGCTAGCTGTGTTCCTGAGGCGCAGCCGCCGCCTAGGGCGGTGGGGAGGAGG 

AGGGAGCCGCGGGGCTTGGCGGGGTCGGGAGGGAGGGACGTGCTGGGGGAACGAGCTGGGGAAGACGGAG 

CGGGCTCTGTGCCGGGCGGGCGGGCGGCGGGGGGGCCAGCGACCGCAGCCGGGGGGACGCGGGAGGATGG 

AGCAAGTGGAGATCCTGAGGAAATTCATCCAGAGGGTCCAGGCCATGAAGAGTCCTGACCACAATGGGGA 

GGACAACTTCGCCCGGGACTTC^TGCGGTTAAGAAGATTG 

CCCACAGCCACTGGAGAAAAAGAAGAAAATGTTAAT^AAGAACAGAXAC^^ 

ACAGCCGAGTTAAATTGACATTAAAGACTCCTTCACAAGATT CAGACTAT AT CAATGCAAATTTTAT AAA 
GGGCGTCTATGGGCCAAAAGCATATGTAGCAACTCAAGGAC^ 

AGGATGATATGGGAGTATAATGTTGTGATCATTGTAATGGCCTGCCGAGAATTTGAGATGGGAAGGAAAA 
AATGTGAGCGCTATTGGCCTTTGTATGGAGAAGACCCCATAACGTTTGCACCATTTAAAATTTCTTGTGA 
GGATGAACAAGCAAGAACAGACTACTTCATCAGGACACTC 

CTGTATCAGTTTCATTATGTGAACTGGCCAGACCATGATGTTCCTTCATCATTTGATTCTATTCTGGACA 
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TGATAAGCTTAATGAGGAAATATCAAGAACATGAAGATGTTCCTATTTGTATTCATTGCAGTGCAGGCTG 
TGGAAGAACAGGTGCCATTTGTGCCATAGATTATACGTGGAATTTACTAAAAGCTGGGAAAATACCAGAG 

GAATTTAATGTATTTAATTTAATAGAAGAAATGAGAACA 
2UVTATGAACTTGTTCATAGAGCTATTGCC 

AGCTCAGAAAATTGCTGATGGAGTGAATGAAATTAACACTGAAAAC^TGATOVGCTCCATAGAGCCTGAA 

AAACAAGATTCTGCTCCTCCAAAAC(^CCAAGGACCCGCAGTTGCCTTGTTGAAGGGGATGCTAAAGAAG 

AAATACTGCAGCCACCGGAACCTCATCCAGTGCCACCCATCTTGACACCTTCT 

AAGAGTCACTACTGTGTGGCAGGACAATGATAGATACC^ 

GAACAACATTCAGCAGACCTCAACAGAAACTATAGT^ 

CAATTGAACAGATAGATAAAAAATTGGAACGAAATTTAAGTTTTGAGATTAAGAAGGTCCCTCT 
GGGACCAAAAAGTraTGATGGGAACACACTTTTGAATAGGG GAC ATG CAATTAAAATTAAAT CTGCTTCA 
CCTTGTATAGCTGATAAAATCTCTAAGCCAC^^ 

AGAATTCTTGTGTGGACTGCAGTGTAAC^CAATCAAACAAAGTTTCAGTTACTCCACCAGAAGAATCC^ 
GAATTCAGACACACCTCCAAGGCCAGACCGCTTGCCTCTT& 

CATGGACCTGAAAATGCCATACCCATACCTGATTTATCTGAAGGCAATTCCTCAGATATCAACTATCAAA 

CTAGGAAAACTGTGAGTTTAACACCAAGTCCTACAACACAAG^ 

TAACACTTGACCACTCTTCAGAACACCCCT 

GATGAAAGAAACTCTGATGGTGCTGTGACCCAGAATAAAACTAATATTTCAACAGCAAGTGCCACAGTTT 
CTGCTGCCACTAGTACTGAAAGCATTTCTACTAGGAAAGTATTGCCAATGTCCATTGCTAGACATAATAT 
AGCAGGAACAACACATTC AGGTGCTGAAAAAGATGTTGATGTTAGTGAAGATTCACCTCCT C CCCTACCT 

TTCAAAGTCAGGAACGATCTGAACAT^AAAAAGTCTGAAGGCTTGATAACCTCTGAAAATGAGAAATGTGA 
TCATCCAGCGGGAGGTATTCACTATGAAATGTGCATAGAATGTCCACCTACTTTCAGTGACAAGAGAGAA 
CAAATATCAGAAAATCCAACAGAAGCCACAGATATTGGTTTTGGTAATCGATGTGG 
CAAGAGATCCACCTTCAGAATGGACATGATTCAGGGAGCTAG^ 

CAGGTGCCACTGAAAGCCAGATTTATAGTATTCCATCTTTAATATGTGGGACTAACAGCA 

TTACCTTAATATTTTTTGCTGGGACCATCTACCTGCCTTATACTACACTTAGGAAAAAGTATTACATATG 

GTTTATTTTGAAACTTCAAGTATTATTGCCTTAATGTCTCTTAACCCTGTTACACGCTGCTTGTAGACAT 

GTTAATATAGTAATACCTTTATGATATATTGAGTTTAAGGACTACTCTTTTTCTGTTTTATCATGTATGC 

ATTATTTTGTATATGTACAGGGCAAGTAGGTATATAATTTGATAAAGTTGCAATTGAAAT^ 

GAAGATGTAAGAAATTTCTGCATGGTCTAAATCTTTGTGTACTTTATTTGTAAATTATTTGCCCTGGAGT 

TTTAGAAAATAGTTTCTGAATTTTAAACTTGCT^ 

AAAGATTGTAATAATAATTTTGTAAATTGTAAGCAAAAAGTTATTTTTATATTATATACAGTCTAATTGT 
TCATCCTAATTGTTCCTGTTTTCATCTAGTCAGAGATTCAGTAAGTGCCTTGGAACAATATTGAATTCTC 
TTAGCTTGTGTGTGTTTCTTTAATATTTGAACTCAAGTGGGATTAGAAGACTATCAAAATACATGTATGT 
TTCAGGATATTTGACCTGTCATTAAAAAAAACAAACAGTTTTACAGTGCCAAAAAAAAA 

Human PTPN12 mRNA sequence - var3 (public gi: 18375651) (SEQ ID NO: 125) 

AGCGACCGC^GCCGGGGGGACGTOGGAGGATGGAGC^GTGGAGATCCTGAGGAAATTCATCCAGAGGGT 
CCAGGCCATGAAGAGTCCTGACCACAATGGGGAGGACAACTTCGCCCGGGACTTCATGCGGTTAAGAAGA 
TTGTCTACGAAATATAGAACAGAAAAGATATATCCC^^ 
AGAACAGATACAAGGACATACTGCCATTTGATCACAGCCGAGT^ 

AGATTC^GACTATATCAATGCAAATTTTATAAAGGGCGTCTATGGGCCAAAAGCATATGTAGCAACTCAA 
GGACCTTTAGGAAATACAGTAATAGATTTTTGGAGGATC 

TGGCCTGCCGAGAATTTGAGATGGGAAGGAAAAAATGTGAGCGCTATTGGCCTTTGTATGGAGAAGACCC 

CATAACGTTTGCACCATTTAAAATTTCTTGTGAGGATGAACAAGCAA 

OTCTTACTTGAATTTCAAAATGAATCTCGTAGGCTGTATCAGTTTCATTATGTGAAOT 

ATGTTCCTTCATCATTTGATTCTATTCTGGAC^TGATAAGCTTAATGAGGAAATATCAAGA 

TGTTCCTATTTGTATTCATTGCAGTGCAGGCTGTGGAAGAACAGGTGCCATTTGTGCCATAGATTATACG 

TGGAATTTACTAAAAGCTGGGAAAATACCAGAGGAATTTAATGTATTTAATTTAATACAAGAAATGAGAA 

CACAAAGGCATTCTGCAGTACAAACAAAGGAGCAATATGAACTTGTTCATAGAGOT 

TGAAAAACAGCTACAACTATATGAAATTCATGGAGCT 

ACTGAAAACATGATCAGCTCCATAGAGCCTGAAAAACAAGATTCTCCTCCTCCAAAACCACCAAGGACCC 

GOVGTTGCCTTGTTGAAGGGGATGCTAAAGAAGAAATACTGCAGCCACCGGAACCTCATCCAGTO 

C^TCTTGAGACCTTCTCCCCCTTCAGCTTTTCC^ 

CATCCAAAGCCAGTGTTGCATATGGTTTCATCAGAACAACATTCAGCAGACCT 
AATCAAC^GAACTTCCAGGGAAAAATGAATGAACAATTGAACAGATAGATAAA 
AAGTTTTGAGATTAAGAAGGTCCCTCTCCAAGAGGGACCAAAAAGTO 

AGGGGACATGCAATTAAAATTAAATCTGCTTCACOTTGTATAGCTGATAAAATOTCTAAGCC^CAGGAAT 
TAAGTTCAGATCTAAATGTCGGTGATACTTCCCAGA^ 

CAAAGTTTCAGTTACTCCACCAGAAGAATCCGAGAATTC^GACACACCTCCAAG^ 

CTTGATGAGAAAGGACATGTAACGTGGTCATTTCATGGACCTGAAAATGCCATACCCATACCTGATTTAT 
CTGAAGGCAATTCCTCAGATATCAACTATCAAACTAGGAAAACTGTGAGTTTAAGACCAAGTC 
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ACAAGTTGAAACACCTGATCTTGTGGATCATGATAACACTO 

ACTAATCCACTTCACTCTGATGACTCAGACTCAGATGAAAGAAACTCTGATGGTGCTGTGACCCAGAATA 

AAACTAATATTTCAACAGCAAGTGCCACAGTTTCTGCTGCCACTAGTACTGAAAGCAT 

AGTATTGCCAATGTCCATTGCTAGACATAATATAGCAGGAACAACACATTCAGGTGCTGAAAAAG 

GATGTTAGTGAAGATTCACCTCCTCCCCTACCTGAAAGAACT 

ATAATACACCTGTAAGATCGGAATGGAGTGAACTTCAAAGTCAGGAACGATCTGAACAAAAAAAGTCTGA 
AGGCTTGATAACCTCTGAAAATGAGAAATGTGAT^ 

GAATGT CCACCTACTTTCAGTGACAAGAGAGAACAAATATCAGAAAAT CCAACAGAAG CCACAGATATTG 
GTTTTGGTAATCGATGTGGAAAACCCAAAGGACCAAGAGATCCACCTTCAGAATGGACATGATTCAGGGA 
GCTAGAAGACACTTTAAGTTATACTGGAAAATTCAGGTGCCACTGAAAGCCAGATTTATAGTATTCCATC 
TTTAATATGTGGGACTAACAGCAGTGTAGATTGTTACCTTAATATTTTTTGCTGGGACCATCTACCTGCC 

TTATACTACAGTTAGGAAAAAGTA 

CTCTTAACCCTGTTACACGCTGCTTGTAGACATGTTAATATAGTAATACCTTTATGATATATTGAGTTTA 
AGGACTACTCTTTTTCTGTTTTATCATGTATGCATTATC 

TTTGATAAAGTTGCAATTGAAATATTATTAACAGAAGATGTAAGAAATTTCTGCATGGTCTAAATCTTTG 
TGTACTTTATTTGTAAATTATTTGCCCTG 

TCATGCAGCC^GCTTTGCAGGTTATC^GAGATCAAAGATTGTAATAATAATTTTGTAAATTGTAAGCAAA 
AAGTTATTTTTATATTATATACAGTCTAATTGTTCATCCTAATTGTTCCTGT TTT CATCTAGTCAGAGAT 
TCAGTAAGTGCCTTGGAACAATATTGAATTCTCTTAGCTTGTGTGTGTTTCTTTAATATTTGAACTCAAG 
TGGGATTAGAAGACTATCAAAATACATGTATGTTTCAGGATATTTGACCTGTCATTAAAAAAAACi^ 

GTTTTACAGTG 

s 

Human PTPN12 mRNA sequence - var4 (public gi: 545651) (SEQ ID NO: 126) 

GTTAAAAGGAACAGATACAAGGACATACTGCCATTTGATCACAGCCGAGTTAAATTC3ACATTAAAGACTC 

OTTCACAAGATTCAGACTATATCAATGGAAATTTTATAAAGGGCGTCTATGGGCCAAAA^ 

AACTCAAGGACCTTTAGCAAATACAGTAATAGATTTTTGGAGGATGGTATGGGAGTATAATGTTGTGATC 

ATTGTAATGGCCTGCCGAGAATTTGA 

Human PTPN12 mRNA sequence - var5 (public gi: 19683 965) (SEQ ID NO: 127) 

GGGACTTCACCACTCTTCAGAACACCCCTCAGTTTTAGTAA 
ATGAAAGAAACTCTGATGGTGCTGTGACCCAGAATAAAACTAATAT^^ 
TGCTGCCACTAGTACTGAAAGCATTTCTACTAGGAAAGTATTGCCAATGTCCATTGCTAG^ 
GCAGGAACAACACATTCAGGTGCrGAAAAAGATGTTGATGTTAGTGAAGAOT 

AAAGAACTCCTGAATCGTTTGTGTTAGCAAGTGAACATAATACACCTGTAAGATCGGAATGGAGTGAACT 
TGAAAGTCAGGAACGATCTGAACAAAAAAAGTCTGAAGGCT 

CATCCAGCGGGAGGTATTCACTATGAAATGTGCATAGAATGTCCACCTACTTTCAGTGACAAGAGAGAAC 
AAATATCAGAAAATCCAACAGAAGCCACAGATATTGGTTTTGGTAATCGATGTGGAAAACCCAAAGGACC 
AAGAGATCCACCTTCAGAATGGACATGATTCAGGGAGCTAGAAGACACTTTAAGTTATACTGGAAAA'raC 
AGGTGCCACTGAAAGCCAGATTTATAGTATTCCATCTTTAATATGTGGGACTAACAGCAGTGTAGATTGT 
TACCTTAATATTTTTTGCTGGGACCATCTACCTGCCOT 

TTTATTTTGAAACTTCAAGTATTATTGCCTTAATGTCTCTTAACCCTGTTACACGCTGCTTGTAGACATG 

TTAATATAGTAATACCTTTATGATATATTGAGTTTAAGGACTACTCTTTTTCTGTTTTATCA^ 

TTATTTTGTATATGTACAGGGCAAGTAGGTATATAATTTGATAAAGTTGCAATTGAAATATTATTAACAG 

AAGATGTAAGAAATTTCTGCATGGTCTAAATCTTTGTGTACTTTATTTGTAAATTATTTGCCCTGGAGTT 

TTAGAAAATAGTTTCTGAATTTTAAACTTGCTGGATTGAT^ 

AAGATTGTAATAATAATTTTGTAAATTGTAAGCAAAAAGTTATTTTTATATTATATAC7VGTCT 

CATCCTAATTGTTCCTGTT1TCATCTAGTCAGAGATTCAGTAAGTGCCTTGGAACAATATTGAATTCTCT 

TAGCTTGTGTGTGTTTCTTTAATATTTGAACT 

TCAGGATATTTGACCTGTCATTAAAAAAAACA7VACAGTTTTACAATAAAAAAAAAAAAAAAA 
AAAAAA 

Human PTPN12 mRNA sequence - var6 (public gi: 220033) (SEQ ID NO: 128) 

GCCGGGGGGACGCGGGAGGATGGAGCAAGTGGAGATCCTGAGGAAATTCATCCAGAGGGTCCAGGCCATG 
AAGAGTCCTGACCACAATGGGGAGGACAACTTCGCCCGGGACTTCATGCGGTTAAGAAGATTGTCTACCA 
AATATAGAACAGAAAAGATATATCCCACAGCCACTGGAGAAAAAGAAGAAAATGTTAAAAAGAACAGATA 
CAAGGACATACTGCCATTTGATCACAGCCGAGTTAAAT 

TATATCAATGCAAATTTTATAAAGGGCGTCTATGGGCCAAAAGCATATGTAGC^ 

CAAATACAGTAATAGATTTTTGGAGGATGGTATGGGAGTATAATGTTGTGATCATTGTAATGGCC^ 

AGAATTTGAGATGGGAAGGAAAAAATGTGAGCGCTATTGGCCTTTGTATGGAGAAGACCCCATAACGTTT 

GCACCATTTAAAATTTCTTGTGAGGATGAACAAGCAAGAAC^^ 

AATTTCAAAATGAATCTCGTAGGCTGTATCAGTTT 

ATCATTTGATTCTATTCTGGACATGATAAGCTTAATGAGGAAATATCAAGAACATGAAGATGTTCCTATT 
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TGTATTCATTGCAGTGCAGGCTGTGGAAGAACAGGTGCCA.TT 
TAAAAGCTGGGAAAATACCAGAGGAATTTAATGTATTTAATT^^ 
TTCTGCAGTACJ^AACAAAGGAGCAATATGAACTTGTTCATAGAGCT 
CTAC^CTATATGAAATTC^TGGAGCTC 

TGGTCAGCTCCATAGAGCCTGAAAAAGAAGATTCTCCTCCTCCAAAACCACCAAGG 

TGTTGj^GGGGATGCTAAAGAAGAAATACTGCAGCCACCGGAACCTCATCCAGTGCCACCCATOT 

CCTTCTCCCCCTTCAGCTTTTCCAACAGTCACTACTGTO 

^ ^ GTGTT GCATATGGTTTCATCAGAACAACATTCAGCAGAC^ 

ACTTCCAGGGAAAAATGAATCAACAATTGAACAGATAGATAAAAAATTGGAACGAAATTTAAGTTTTGAG 

ATTAAGAAGGTCCCTCTCCAAGAGG^ 

CAATTAAAATTAAATCTGCTTCACCTTGTATAGCTQATAAAATCTCT 
TCTAAATGTCGGTGATACTTCCCAGAATTCTTGTGTGGACTGCAGTGTAAC^ 

GTTACTCCACCAGAAGAATCCCAGAATTCAGACACACCTCCAAGGCCAGACCGCTTGCCTCTTGATGAGA-^ 
AAGGACATGTAACGTGGTCATTTC^TGGACCTGAAAATGCCATACCCATACCTGATTTATCTGAAGGC^ 

TTCCTCAGATATCAACTATCAAACTAGGAAAACTGTGAGTTTAAGACCAAG^^ 
ACACCTGATCTTGTGGATCATGATAACACTTCACCACTCTTCAGAACACCCCTCAGTTTTACT 
TTCACTCTGATGACTCAGACTCAGATGAAAGAAACTCTGATGGTGCTGTGACCCAGAATAAAACTAATAT 
TTCAACAGCAAGTGCGACAGTTTCTGCTGCCACTAGTACTGAAAGCATTTOT 

ATGTCCATTGCTAGACATAATATAGCAGGAACAACACATTCAGGTGCTGAAAAAGATGTTGATGTTAGTG 
AAGATTCACCTCCTCCCCTACCTGAAAGAACTCCTGAATCGTTTGTGTTAGCAAGTGAACATAATACACC 
TGTAAGATCGGAATGGAGTGAACTTCAAAGTCAGGAACGATCTGAACAAAAAAAGTCTGAAGGCTTGATA 
ACCTCTGAAAATGAGAAATGTGATCATCCAGCGGGAGGTATTCACTATGAAATGTGCATAGAATGTCCAC 
CTACTTTCAGTGACAAGAGAGAACAAATATCAGAAAATCCAACAGAAGCCAGAGATATTGGTT 
TCGATGTGGAAAACCCAAAGGACCAAGAGATCCACCTTC^GAATGGACATGATTCAGGGAGOTAGAAG 
ACTTTAAGTTATACTGGAAAATTCAGGTGCCAC^GAAAGCCAGATTTATAGTATTCC^TCTTTAATATGT 
GGGACTAACAGC^GTGTAGATTGTTACCTTAATATTTTTTGCTGGGACCATCTACCTGCCTTATACTACA 
CTTAGGAAAAAGTATTACATATGGTTTATTTTGAAACTTCAAGTATTATTGCCTTAATGTCTCTTAACCC 
* TGTTACACGCTGCTTGTAGACATGTTAATATAGTAATACCTTTATGATATATTGAGTTTAAGGACTACCC 
TTTTTCTGTTTTATCATGTATTC^TTATTTTGTATATGTAC^GGGCAAGTAGGTATATAATTTGATAAAG 
TTGCAATTGAAATATTATTAACAGAAGATGTAAGAAATTTCTGCATGGTCTAAATCTTTGTGTACTTTAT 
TTGTAAATTATTTGCCCTGGAGTTTTAGAAAATAGTTTCTGAATTTO 

AGCTTTGCAGGTTATCAGAGATCAAAGATTGTAATAATAATTTTGTAAATTGTAAGCAAACATTCTGC 

Human PTPN12 protein sequence - varl (public gi: 220034) (SEQ ID NO: 266) 

MEQVEILRKFI QRVQAMKS PDHNGEDNFARDFMRLRRLSTKYRTEKI YPTATGEKEENVKKNRYKDI LPF 
DHSRVKXiTLKTPSQDSDY I NANFI KGVYGPKAYVATQGPIANTVI DFWRMVWE YNVVI I VMACREFEMGR 
KKCERYWPLYGEDP I TFAPFKI SCEDEQARTDYF I RTLLLE FQNESRRLYQFHYVNWPDHDVPS S FDS I L 
DMI SLMRKYQEHEDVPI CIHCSAGCGRTGAI CAIDYTWNLLKAGKI PEEFNVFNLIQEMRTQRHSAVQTK 
EQYELVHRAI AQLFE KQLQLYE IHGAQKI ADGVNE INTENMVSS I E PEKQDS P PPKP PRTRS CLVEGDAK 
EEILQPPEPHPVPPILTPSPPSAFPTVTTVWQDNDRYHPKPVLH^ 
STIEQIDKKLERNLSFEIKXVPI^EGPKSFDGNTLLNRGHAIM 

SQNSCVDCSVTQSNKVSVTPPEESQNSDTPPRPDRLPLDEKGHVTWSFHGPENAI PI PDLSEGNSSDINY 
QTRKTVSLTPSPTTQVETPDLVDHDNTSPLFRTPIiSFTNPLHSDDSDSDERNSDGAVTQNKTNISTASAT 
VSAATSTESISTRKVLPMSIARmiAGTTHSGAEKDVDVSEDSPPPLPERTPESFVIASEHOT 
ELQSQERSEQKKSEGLITSENEKCDHPAGGIHYEMCIECPPTFSDKREQISENPTEATDIGFGNRCGKPK 

GPRDPPSEWT 

Human PTPN12 protein sequence - var2 (public gi: 7689910) (SEQ ID NO: 267) 

VKRNRYKDILPFD5SRVKLTLKTPSQDSDYINANFIKGVYGP 
IVMACREF 

Human PTPN12 protein sequence - var3 (public gi: 292409) (SEQ ID NO: 268) 

MEQVE I LRKFIQRVQAMKS PDHNGEDNFARDFMRLRRLSTKYRTE KI YPTATGE KEENVKKNRYKDILPF 
DHSRVKLTLKTPSQDSDYINANFIKGVYGPKAYVATQGPLANTVIDFWRMIWEYNV^ I VMACREFEMGR 
KKCERYWPLYGEDPI TFAPFKI SCEDEQARTDYFI RTLLLE FQNES RRL YQFHYVNWPDHDVPSS FDS I L 
DMI S LMRKYQEHEDVP I C IHCSAGCGRTGAI CAIDYTWNLLKAGKI PEEFNVFNLI QEMRTQRHS AVQTK 
EQYELVHRAI AQLFE KQLQLYE IHGAQKIADGVNE INTENMI S S I E PEKQDSPPPKP PRTRS CLVEGDAK 
EEILQPPEPHPVPPILTPSPPSAFPTVTTWQDNDRYHPKPVLHMVSSEQHSADLNRNYSKSTEL 

STIEQIDKKLERNLSFEIKKVPLQEGPKSFDGNTLLNRGHAIKIK^ 

SQNS CVDCS VTQSNKVSVT PPEESQNSDTPPRPDRLPLDEKGHVTWS FHGPENAI P I PDLSEGNS SDINY 
QTRKTVSLTPS PTTQVET PDLVDHDNTS PLFRTPLS FTNPLHSDDSDSDERNSDGAVTQNKTNI STASAT 
VS AATSTES I STRKVLPMS I ARHNI AGTTHSGAEKDVDVSEDS PP PLPERTPES FVLAS EHNTPVRSEWS 
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ELQSQERSEQKKSBGfclTSENEKCDHPA^^ 
GPRDPPSEWT 

Human PTPN12 pray sequence- varl (SEQroNO:129) 

GTTTGGNATNCTACAGGGNATGTTTAATACCACTACAATGGATGATGTATATAACTATCTATTCGATGAT 
GAAGATACCCCACCAAACCCAAAAAAAGAGATCTTTAATACGACTCACTATAGGGCGAGCGCCGCCATGG 

AGTACCCATACGACGTACCAGATTACGCTCATATGGCCATGG 

TATCAACGCAGAGTGGAATTATGGGCGGCGGCGGCGGCGTCTCCGACGGAGGAGGAGGGCGGGGAAGGAG 
GATGGAGCAAACAAAGTTTCAGTTACTCCACCAGAAGAATCCCAGAATTCAGACAC^C 
ACCGCTTGCCTCTTGATGAGAAAGGACATGTAACGTGGT 
ACCTGATTTATCTGAAGGCAATTCCTC^^ 

AGTCCTACAACACAAGTTGAANGCACCTGATCTTGTGGATCATGATAACGCTTCACCACTCTTCAGAACA 

CCCCTCANTTTTACTAATCCACTTCACTCTN^ 

TGACCCANAATAAAACTAATATTTCAACAGCAAGT^ 

TTCTACTAGGAAAGTATTGCCNATGTCCATTGCTAGACNTANTATANCAGGAACANACATTCAGGTGCTG 
AAAAAAANTTNATGTTI^TGAANATTNCCTNCTCCC3^NCCTNAANAACTCC 

Unigene Name : RALA Unigene ID : Hs.6906 Clone ID : 3GD_1106 
Human RAlAmRNA sequence- varl (public gi: 35845) (SEQ ID NO: 130) 

ATGGCTGCAAATAAGCCCAAGGGTCAGAATTCTTTGGCTTTACACAAAGTCATCATGGTGGGCAGTGGTG 

GCGTGGGCAAGTCAGCTCTGACTCTACAGTTCATGTACGATGAGTTTGTGGAGGACTATGAGCCTACCAA 

AGCAGACAGCTATCGGAAGAAGGTAGTGCTAGATGGGGAGGAAGTCCAGATCGATATCTTAGATACAGCT 

GGGCAGGAGGACTACGCTGCAATTAGAGACAACTACTTCCGAAGTGGGGAGGGGTTCCTCTGTGTTTTCT 

CTATTACAGAAATGGAATCOTTTGCAGCTACAGCTGACTTCAGGGAGCAGATTTTAAGAGTA7UVAGAAGA 

TGAGAATGTTCCATTTCTACTGGTTGGTAACAAATC^ 

(^GGCAAAA^CAGAGCTGAGCAGTGGAATGTTAACTACGTGGA 

TTGAC^GGTATTTTTTGATTTAATGAGAGAAATTCGAGCGAGAAAGATGGAAGACAGCAAAGAAAAGAA 
TGGAAAAAAGAAGAGGAAAAGTTTAGCCAAGAGAATCAGAGAAAGATGCTGCATTTTATAA 

Human RALA mRNA sequence - var2 (public gi: 24980846) (SEQ ID NO: 131) 

CCGCTCCCCAGAGGA^GCGTCGGAGTCCTCOTCCT 
CCC^GGCTCCCCCGCC^CCCGTCAGACTCCT^ 

GGACCGAGTACCCTCCGGCCGGAGCCACGCAGCCGCGGCTTCCGGAGCCCTCGGGGCGGCGGACTGGCTC 
GCGGTGCAGATTCTTCTTAATCCTTTGGTGAAAACTGAG^ 

(^GAATTCTTTGGCTTTACACAAAGTCATCATGGTGGGCAGTGGTGGCGTGGGCAAGTCAGCTCTGACTC 
TACTVGTTCATGTACGATGAGTTTGTGGAGG^CTATGAGCCTACCAAAGCAGACAGCTATCGGAAGAAGGT 
AGTGCTAGATGGGGAGGAAGTCCAGATCGATATCTTAGATACAGCTGGGCAGGAGGACTACGCTGCAATT 
AGAGACAACTACTTCCGAAGTGGGGAGGGGTTCCTCTGTGTTTTCTCTATTACAGAAATGGAATCCTTTG 
CAGCTACAGCTGACTTCAGGGAGCAGATTTTAAGAGTAAAAGAAGATGAGAATGTTCCATTTCTACTGGT 
TGGTAACAAATCAGATTTAGAAGATAAAAGACAGGTTTCTGTAGA 

TGGAATGTTAACTACGTGGAAACATCTGCTAAAACACGAGCTAATGTTGACAAGGTATTTTTTGATTTA^ 

TGAGAGAAATTCGAGCGAGAAAGATGGAAGACAGCAAAGAAAAGAATGGAAAAAAGAAGAGGAAAAGTT^ 

AGCCAAGAGAATCAGAGAAAGATGCTGCATTTTATAATCAAAGCCCAAACTCCTTTCTTATCTTGACCAT 

ACTAATAAATATAATTTATAAGC^TTGCCATTGAAGGCTTAATTGACTGAAATTACTTTAACATTTTGGA 

AATTGTTGTATATCZACTAAAAGCATGAATTGGAACTGCZAATGAAAGTCAAATTTACT^ 

AATATGGCTTCACCAAGAAGCAAAGTTCAACTTATTTCATAATTGCCTACATTTATCATGGTCCTGAATG 

TAGCGTGTAAGCTTGTGTTTCTTGGGGAGTCTTTCTTGAAATTGAAGAGGTGAAATGGGGGTGGGG 

GGAGGAAAGGTGACTTCCTCTGGTGTTTATTATAAAGCTTAAATTTTATATCATTTTAA 

CTTCTACTGCCTTGAAAAATGACAATTGTGAACATGATAGTTAAACTACCACTTTTTTTAACC^TTAT 

TGCAAAATTTAGAAGAAAAGTTATTGGCATGGTTGTTGCATATAGTTAAACTGAG 

AATCK3CTTTAATTACCTGGTGAGTAACTTAGAAAAGTGGTC 

TGCCTTAATTTAGAAACTGAAAAATATCTGGTTATATCAT^ 

CCGCTGCCCCATGTGTCCTGGTGAGAAAATATATGCCTGG 

GTAACTGTCCGCTAGAAGTCTGTCCAAATTTAAA^ 

CAGAGCTCTTTGATCGCTTTTAATAAACTGCAAGTTCATTTTAAATGAAGGGCCAGCATATATACTTGCA 

AGATAATTTTCAGCTGCAAGGATTCAGC^CCAGTTATGT 

GGTCCCTGGAAATCCCTTTCTGCTAGTGGTGAGGATGT^ 

TAGGAAGAAAATGTCAGTAGTGCTAATGCATTTTGCACTAGAACGCTTCGGGAAAATATTCATGCITGCC 
ATCTGTTCATTTCTAAATTTATATTCATAAAGTTACAGTTTGATACAGGAATTATTAGGAGTAATTCTTT 
TCTGTTTCTGTTTATAATGAAGAAGACTGTAGCTACATTTTCAGAAGTTAACATCAAGCCA 
GGTATAGTG CAG AAAACGTGGCACACACTGAC CACA.CATTAGGCTGTGTCACCATTGTGTGGTGTACCTG 
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CTGGAAGAATTCTAGCATGCT^^ 
TTTTTCCTCTTTGCAGTGGGGCTAGGAGAGTTO^ 
ATTTGAACAGGTCAAAGATGTGTTCAGGCATTCCAGGTAACAGGTGTO 
TTTTA^AACTCACTCTTTAGATATTTACATC 

AGATGTACATCTTTCATTTCGTATTTCTCATAGGCTATC 
""ACTGGCGAGCGGGCCCAGCAATCTCCATGTGTACTTATTACAGTCTTATTTAACCAGGGGTCCTAACCAC 

TAACATTGTGACTTTGCTTTGAGACCTTTCCTCTCCTO 
^^ATGCATCACGTGTCTTAGGCTGATGCC^CTACCCGATTTGT^ 
CAATAAACTTCCTTTTTTAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human RALA mRNA sequence - var3 (public gi: 3483427) (SEQ ID NO: 132) 

ATAATCAAAGCCCAAACTCCTTTCTTATCTTGACCATA 

AAGGCTTAATTGACTGAAATTACTTTAACATTTTGGAAATTGTTGTATATCACTAAAAGC^ 

ACTGC^TGAAAGTCAAATTTACTTTAAAAAGAAATTAATATGGCTTCAC CAAGAAGCAAAGTT CAACTT 

ATTTCATAATTGCCTACATTTATCATGGTCCTGAATGTAGCGTGTAAGCTTGTGTTTCTTGGGCA.GTCTT 

TCTTGAAATTG7VAGAGGTGAAATGGGGGTGGGGAGTGGGAGGAAAGGTGACTTCCTCTGGTGTTTATTAT 

AAAGCTTAAATTTTATATCATTTTAAAATGTCTTGGTCTTCTACTGCCTTGAAAAATGACAATTGTGAAC 

ATGATAGTTAAACTACCACTTTTTTTAACCATTATTATGCAAAAAAAAAA 

Human RALA mRNA sequence - var4 (public gi: 20147712) (SEQ ID NO: 133) 

ATGGTCGACTACCTAGCAAATAAGCCCAAGGGTQAGAATTCTTTGGCTTTACACAAAGTCATCATGGTGG 
GCAGTGGTGGCGTGGGCAAGTCAGCTCTGAOTCTACAGTTCATGTACGATGAGTTTGTGGAGGACTATGA 
GCCTACCAAAGCAGACAGCTATCGGAAGAAGGTAGTGCTAGATGGGGAGGAAGTCCAGATCGATATCTTA 
GATACAGCTGGGCAGGAGGACTACGCTGCAATTAGAGACAACTACTTCCGAAGTGGGGAGGGGTTCCTCT 
GTGTTTTCTCTATTACAGAAATGGAATCCTTTGCAGCTACAGCTGACTTCAGGGAGCAGATTTTAAGAGT 
AAAAGAAGATGAGAATGTTCC^TTTCTACTGGTTGGTAACAAATCAGATTTAGAAGATAAAAGACAGGTT 

TCTGTAGAAGAGGCAAAAAACAGAGCTGAGCAGTGGAATGT^ 

GAGCTAATGTTGACAAGGTATTTTTTGATTTAATGAGAGAAATTCGAGCGAGAAAGATGGAAGACAGCAA 
AGAAAAGAATGGAAAAAAGAAGAGGAAAAGTTTAGCCAAGAGAATCTVGAGAAAGATGCTGCATTTTAT^ 

Human RALA mRNA sequence - var5 (public gi: 10439805) (SEQ ID NO: 134) 

AGAATGGAAAAAAGAAGAGGAAAAGTTTAGCCAAGAGAATCAGAGAAAGATGCTGCATTTTATAATCAAA 

GCCCAAACTCCTTTCTTATCTTGACCATACTAATAAATATAATTTATAAGCATTGCCATTGAAGGCTTA 

TTGACTGAAATTACTTTAACATTTTGGAAATTGTTGTATATCACTAAAAGCATGAATTGGAACTGCAATG 

AAAGTC^AATTTACTTTAAAAAGAAATTAATATGGCTTCACCAAGAAGCAAAGTTCAACT 

TTGCCTACATTTATCATGGTCCTGAATGTAGCGTGTAAGCTTGTGTTTCTTGGGCAGTCTTTCTTGAAAT 

TGAAGAGGTGAAATGGGGGTGGGGAGTGGGAGGAAAGGTGACTTCCTCTGGTGTTTATTATAAAGCTTAA 

ATTTTATATCATTTTAAAATGTCTTGGTCTTCTACTGCCTTGAAAAATGACAATTGTGAACATGATAGTT 

AAACTACCACTTTTTTTAACCATTATTATGCAAAATTTAGAAGAAAAGTTATTGGGATGGTTGTTGG^ 

TAGTTAAACTGAGAGTAATTCATCTGTGAATCTGCTTTAATTACCTGGTGAGTAACTTAGAAAAGTGGTG 

TAAACTTGTACATGGAATTTTTTGAA^ 
. TGGGTGTGTTCTTACTGACACCAGGGGTCCGCTGCCCCATGTGTCCTGGTGAGAAAATATATGCCTGGCA 

CAGCTTTTGTATAGAAAATTCTTGAGAAGTAACTGTCCGCTAGAAGTCTGTCCAAATTTAAAATGTGTGC 

CATATTCTGGTTCTTGAAAATAAGATTCCAGAGCTCTTTGATCGCTTTTAATAAACTGCAAGTTCATTTT 

AATTGAAGGGCC7VGCATATATACTTGCAAGATAATTTTCAGCTGCAAGGATTCAGCACCAGTTATGTTTG 

AATGAACCOTCCTTTTCTCTGAGATTCTGGTCCCTGGAAATCCCTTTCTGCTAGTGGTGAGC^TGTAAG^ 

GTTAAGTTTTTAATCTGGGAGCAGGGCATAGGAAGAAAATGTCAGTAGTGCTAATGCATTTTGCACTAGA 

ACGOTTCGGGAA ^ TATT( ^ TGCTTO 

ATACAGGAATTATTAGGAGTAATTCTTTTCTGTTTCTGTTTATAATGAAGAACACTGTAGCTACATTTTC 
AGAAGTTAACATCAAGCGATGAAACCTGGGTA 

GCTGTGTCACCATTGTGTGGTGTACCTGCTGGAAGAATTCTAGCATGCTACTTGGGGACATAATTTCAGT 

GGGAAATATGCCACTGACCGATTTTTTTTTTTTTCCTC^TTGCAGTGGGGCTAGGACAG 

AAGTATTTTTTTCTTTTTTCTCAGTCCTAATTTG 

AGGTGTGTATGTAAAGTTAAAAATAGGCTTTT^^ 

ATGTTAAATATTTCTCCTTAAAGGGTTTGA 

ATGTGCGGAATTCAAGTTACCAATGTAACACTGGCCAGC^ 

AGTCTTATTTAACCAGGGGTCCTAACCACTAACATTGTGACTTTGCTTTGAGACCTTTCCTCTCCTGGGT 

ACTGAGGTGCTATGAAGCCAACTGACAAAGATGCATCACGTGTCTTAGGCTGATGCCACTACCC 

TTTATTTGCAATTTGAGCCATTTAAAGACCAATAAACTTC 

AAAAA 

Human RALA Protein sequence - varl (public gi: 35846) (SEQ ID NO: 269) 



Figure 36 part - 74 



P C TV USD "■»■ /" O B 3 O 8 

Human RALA Protein sequence - var2 (public gi: 20147713) (SEQ ^ NO: 270) 

I^YLANKPKGQNSIALHKVIMVGSGGVGKSALTWFMYDEFVEDYEPTKADSYRKK^ 
OTAGQEDYAAIRDNYFRSGEGFLWFSITEMESFAATADFREQILRVKEDENVPFLLVG^DLED^QV 

SVEEAKNRAEQWNVNYVETSAKTRANVDKVFFDIiMREIRARKMEDSKEKNGKKKRKSI^^ 
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Unicyene Name : SIAH1 Unicjene ID : Hs . 295923 Clone ID: 3GD_15 0 
Human SIAH1 mRNA sequence - varl (public gi: 27503513) (SEQ ID NO: 135) 

CCAGCGCGTCGCCCCCTGCATCCOT^ 
aaaACAAACTTGKjGAGACAGACGAAA 
TGGAGATGCAGTCGAATTTGAGCTCACAGGGAGGTG^ 

TCCACCTCTGTAACTGGTGCTTCTGAGAAGTAAATGGTATTTGGATCCTGACCTCAGACGTGAATTTGGG 
TCTTCTGTGCTTAGGAGCAGAAAGAGCCCAGGAGGGGCCTGTTCCTTTACTTCTTGGGGGAAACGCAATG 
CGTGGCCTGACTTCTCATGACGGGAAAGGCTACTCCACCTTCTCTGTACTCCTGGAGGGGAGTCTTGTTC 

ACAT6TTTACCAGOGGCCAQGACAAGGAAGAGAAAAGA 
CCGGTACCTCGAAGTGTCCACCATCCCAGAGGGTGCCTGCCCT 

CTTGGCGAGTCTTTTTGAGTGTCCAGTCTGCTTTGA^TATGTGTTACCGCCCATTCTO 
GGCCATCTTGTTTGTAGCAACTGTCGCCCAAAGCTCAC^TGTTGTCCAACTTGCCGGGGCCCT 

CC^TTCGCAACTTGC^TATGGAGAAAGTC^ 

ATGTGAAATAACTCTGCGAGACACAGAAAAAGCAGACCATGAAGAGCTCTGTG 

TGTCCGTGCCCTGGTGCTTCCTGTAAATGGCAAGGCTCTCTGGATGCTGTAATGCCCCATCTGATGCATC 

AGCATAAGTCCATTACAACCCTAC^^ 
TGCTGTTGACTGGGTGATGATGCAGTCCTGTTTTGGCTTT 

AAATACGATGGTCACCAGCAGTTCTTCGCAATCGTACAGCTGATAGGAACACGCAAGCAAGCTGAAAATT 
TTGCTTACCGACTTGAGCTAAATGGTCATAGGCGACGATTGACTTGGGAAGCGACTCCTCGATCTATTCA 

TGAAGGAATTGCAACAGCCATTATGAATAGCGACT^ 

GCAGAAAATGGCAATTTAGGCATCAATGTAACTATTTCCATGTGTTGAAATGGCAATCAAACATTTTCTG 
GCCAGTGTTTAAAACTTCAGTTTCACAGAA^ 

GTAGGTGGAAGCTAGACACATGAAGGTAAATAAAAAGAAAGGCTGTTAAATACAGGAAACAG 

AGTAACACTAATATATTTAAAAATAAGTCAACAGTAAAC 

ATGGGCATCTTTTGTATTAAGAAAGGAAGCATTGTAAAATAATO 

ATTGTATTGTTGAAAAAGTTTGTTTTTC 

TTTTTCCTTTAACTGACAAGCCATCTTGAG 

TAGTGCTGCTGTGTGCTTTTTTTGTGTGTATTTGCTAATTTTTATTAATTTTAGTTTTTCATTAAATAAA 

TTTGACTTTTCTGTAATTCAGGTTTTTCCT 

TATTTGTTTATGGTAAAAAATTTATAACGTGTTCAATAT^ 

AATATTTTAAAATCAGCTATTT^ 

CAAAAGCTATTTAAAGCATCTATAAGGTGCTCTCTTTCTGTCTTCTACAGATGAGTCACACCTTTGAGCT 
TAATCTTTGAAAGGTTAGAGAATAAATTGATTTTTATAAATACTGCAAATC^GGCTTTTGTTTCCTTTTT 
CAGATATCTTGGACAAATCACATAT TTTAAAATTTGTT CTTGTATTTATTGGTTTTGCAGAAGAAGGCAT 
CGTCATGGACAGTATTTGTAATTAAAAGCAAATCAT^ 
GGTCTTTTATAATTCTCATTAAAAGAATATCTC^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
AAAA 

Human SIAH1 mRNA sequence - var2 (public gi: 4 506946) (SEQ ID NO: 136) 

GCGGCGGCCAGGGGGGAGCCGGGGCGGCCGTTGCGGGGCGCGCTCTCGAGAGGCGGCGGCGGCCAGGGTG 
TCCCGTCGGTCTCGGCGCCGGGAAGAGGCGGTGGCGCTGCCCGCGGTGGCGGGGGTTGGCGACGGAGCGC 
GTTGGTGCCAGGACCGGGGTCCGAGGCGCGCTCTCCGCCCACAGAAATGAGCCGTCAGACTGCTACAGCA 

TTACCTACCGGTACCTCGAAGTGTCCACCATCCCAGAGGGTC 

ACAATGACTTGGCGAGTCTTTTTGAGTGTCCAGTCTGCTTTGACTATGTGTTACCGCC 

TTGGGATCCATTCX3CAACTTGGCTATGGAGAAAGTGGCTAATTCAGTACTTTTCCCCTGTAAATATGCGT 
CTTCTGGATGTGAAATAACTCTGCCACACACAGAAAAAGCAGACCATGAAGAGCTCTGTGAGTTTAGGCC 
TTATTCCTGTCCGTGCCCTGGTGCTTCCTGTAAATGGCAAGGCTCTCTGGATGCTGTAATGCCCCATCTG 
ATGCATCAGGATAAGTCCATTACAACCCTACA 

TTCCTGGTGCTGTTGACTGGGTGATGATGCAGTCCTGTTTTGGCTT 
ACAGGAAAAATACGATGGTCACCAGCAGTTCTTCGCAATC 

GAAAATTTTGCTTACCGACTTGAGCTAAATGGTCATAGGCGACGATTGACTTGGGAAGCGACTCCTCGAT 

CTATTGATGAAGGAATTGGAACAGCCATTATGAATAGCGAC^ 

GCTTTTTGCAGAAAATQGCAATTTAGGCATCAATGTAACTATTTCCATGTGTTC 

TTTTCTGGCCAGTGTTTAAAACTTCAGTTTCACAGAAAATAAGGCACCCATCTGTCTGCCAACCTA 

TCTTTCGGTAGGTGGAAGCTAGACACATGAAGGTAAATAAAAAGAAAGGCTGTTAT^ATACAGGAAACAGT 

TGCATGTAGTAACACTAATATATTTAAAAATAAGTCAACAGTAAACCACTGAAAAAATATATGTA 

ACCCAAGATGGGCATCTTTTGTATTAAGAAAGGAAGCATTGTAAAATAATTOT 

TAGATTGATTGTATTGTTGAAAAAGTTTGTTTTTC 

TTGGTTTTTCCTTAACTGACAAGCCATCTTGAGTGGTCATGGGCCACTGCTTTTCCCTTTG 
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ACATAGTGCTGCTGTGTGCTTT^ 

aaatttgacttSctgtaatt 

GCATATTTGTTTATGGTAAAAAATTTATAACGTGTO 
AGAA^^MTTTAAAAT 

AAT^C^GCTATTTAAAGCATCTATAAGGTGCTCT 
- - GCTTAATC TTGAAGGTTAGAGATAAATTGATT1TTATAAATACTC 

^ TATCTTGGACAAATCACATATTTTAAAATTTGTTCTTGTATTTATTGGTTTTG^ 
CGTCATGCACAGTATTTGTAATTAAAAGCAAATCATTTGTTTAAAA^ 



•CAGA' 
"CGTCJ 
GGTCTTTTATAATTCTCA 



Human SIAH1 mRNA sequence - var3 (public gi: wssim) (SEQ ID NO: 137) 

TTTATAATAGCCCTCCAAATGGGTGTGACGTATT 

ATTTTCTAGCATTGATTTTTAAGATGTTACCCAAAGA "* 

TGAAAATTATTAACTCTAGAAATTTTAGTTTA 

ACGTTTTATGCATGGGTTTTTAAAACT^ 

AGAAATCCTACACTGTCCACTGGAGACATCCATGTTTTTACTTGGCTCTGCCCCTTTAGTGGTCCCTGTG 
AACCTTACCTCAAACCATGCATCTGGGGCAGAGATCCTTACT 

TGAAGAATGT(^TCTTTGTGATTGTTCCTGAAATAGTTCACGAGAAATCCATGACCGTAAAGTACTGTGA 

TAGTGATGTCTACCACTGTGAGCTTCCAGTACTAGGTGATTGGTCTGCATTCACAGTGACCAAAATCAGC 

TATGTGGCCAGGTAATTCACTGCTGAGGGCTTTGGATTTTCCTTTATGAACTACTGAAATGAGGTCAACT 

TGACTATTACTAAGGGACATTTTGCTACAAAGAATGTTAGTTTTGCCAATTCCCTTTCCAAATCTAAAAT 

TTATTTTAACCAGGATTTTAGATGTAAACATCAAGTAGTTTTGGTTGTTTCAATGAAGT^ 

GCTCACATTATTTGAAGTACTTCAGTTCCTAOT 

AAAAAAAAGACTACAGTTAGTCATTATCCAATTTGATGATTTATGGTCCAACACTAATGCTCATTTTTTT 

TGTTTTTTTTACAAACATTTGGTGGATACCACAA 
GGAAGGAAATATCAAAAAGGTCTGAATAGACAACAGGCAAATATGCT^ 

GTTGGCAGACAGATGGGTGCCTTATTTTCTGTGAAACTGAAGTTTTAAACACTGGCCAGAAAATGTTTGA 

'ttgccatttcaacacatggaaatagtt^^ 

TGCTGGTGTCAAAGACTAGACAGTCGCTATTCATAATGGCTGT^ 

AGTCGCTTCCC^GTCAATCGTCGCCTATGAC<^TTTAGCTCAAGTCGGTAAGCAAAA 

TTGCGTGTTCCTATCAGCTGTACGATTGCGAAGAACTGCTC 

AGACTAACATGAAGTGAAAGCCAAAACAGGACTGCATCATCACCC^ 

GTCTGTAGCAAGAAAAA.CTATATCCTCTCCCTGTAGGGTTGTAATGGACTTATGCTGATGCATCAGATGG 
GGCATTACAGCATCCAGAGAGCCTTGCCATTTACAGGAAGCACCAGGGCACGGACAGGAATAAGGCCTAA 
ACTCACAGAGCTCTTCATGGTCTGCTTTTTCTGTGTGTGGCAGAGTTATTTCACATCCAGAAGACGCATA 
TTTACAGGGGAAAAGTACTGAATTAGCCACTTTCTCCATAGCCAAGTTGCGAATGGATCCCAAAGGGCCC 
CGGCAAGTTGGACAACATGTGAGCTTTGGGCGACAGTTGCTA 

GAATGGGCGGTAACACATAGTCAAAGCAGACTGGACACTCAAAAAGACTCGCCAAGTCATTGTTGGATGC 

AGTTGTGCCAGTCAGGGCAGGCACCCTCTGGGATGGTGGACACTTCGAGGTACCGGTAGGTAATGCTGTA 

GCAGTCTGACGGCTCATTTCTGAAATAAATACATAAGGAGGCZAGGAGAAAAATAATTATAACCATGACTT 

ACTTTATAAATAATGTTTACATGCGATAAGTCCTTTTAAAGTTTCATACAAA 

GAAGAAAAATAGGATTAAAAAAGATATT 

Human SIAH1 mRNA sequence - var4 (public gi: 21753759) (SEQ ID NO: 138) 

TTTCACCCCCAAGACAAATAGTGGCCTGCC^^ 
TTTATCTTTGACTCAGCCTGGCTAGT^^ 
AATGTTTATTACTCTGCTGATGAATG^ 
CCTGTTGACTCfcATCCCCCACCCCCAGTATGGCAT^ 

TTTTCATTGTCTCTTCCTGACTGTCCTCCTTTGTTCATTATGTCTGAGACATATTGTGGATTGAAAGTAG 

AACAGAAAGATATACTTTCTCTACCAGACTAAAAAGTTTTGAGATGGCCCTCCATTTCTCCCATG 

CTTCACCTTAGTTGTGTTTTTATTTATTTTTAT^ 

TCTTGAGGTGGGCACTGATCAGTAGGAAATAAGATTAATACCTGGCTGGTGATAATTTGGGGGGAAGACT 
TAATTAGATAGAGATGGATAATGGGATGGCAGCAGACCTT^ 

AATACACCTCTAGAGTAGATAATTG<^ACCATTAAGAAGAGTTAATGGAAGGTGATACT 

GGCATTGGAACTACATTCAATCCGCGGTATCCTCGGATTAGTTCTAGGACCCCCTTCTCCATACCAAAAC 

CTGAGGATGCTCAAGTCCCTGATAGAAAATGGTGTCATATTTGTA^ 

AGTGATCTCTGGATTACTTAATACAATGTAAACAATATGTAAATAGTTGTTATAGACTGTATTTTAAA 

TTTTGTATTCTTTATAAATTTTTCTGAATATTTTCAATCCATGGCTGGTGAAGT 

TGTGGATACAGAGTGCCGATTTTATACAGGAGTTGA£ 

CAAATTAGAAAGAAATAGAGTAAGGGAGCCTCAGGGAGAGTCCTAGCAAAACGGATTCGATTAAACTTCA 

GTTCCTTGTATAGrTTCTTTAGTTGTTTATGGTCCATTTTCTATTTTAGCATTTA 

TATCCAAAGACGATTAAGGGAGTTCCACATGTTTTCCGGAACATTTTGAAAAGAGAGCTTATCCAGTG 
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CAGATCCTAATAAAGTGCACATTC^GTGTAATTTTATTTTTTTAATATCTTT 
CCTCTTTTGCTCAGTA 

TAAGTCATGGTTATAATTATTTTTCTCCTGCCTCCTTATGTATT^ 
TCCMCCAA^ 

CTTCAATGTCAGAGTGGCCATCTTGTTTGTAGCAACTC 

GGGGCCCTTTGGGATCCATTCGCAACTTGGCTATGGAGAAAGTGGCTAATTCAGTACTTTTGCCCTGTAA 

ATATGCGTCTTCTGGATGTGAAATAACTCTGCCACACACAGAAAAAGCAGACCATGAAGAGCTCTGTGAG 

TTTAGGCCTTATTCCTGTCCGTGCCCTGGTGCTTCCTGTAAATGGCAAGGCTCTCTGGATGCTGTAATGC 

CC(^TCTGATGCATCAGCATAAGTCC^TTACAACCCTACAGGGAGAGGATATAGTTT^ 

CATTAATCTTCCTGGTGCTGTTGACTGGGTGATGATGCAGTCCTGTTTTGGCTTTC^COT 

TTAGAGAAACAGGAAAAATACGATGGTCACCAGCAGTTCTTCGGAATCGTACAGCTGATAGGAAC^CGC^ 

AGCAAGCTGAAAATTTTGCTTACCGACTTGAGCTAAATGGTCATAGGCGACGAXTGACTTGGGAAGCGAC 

TCCTCGATCTATTCATGAAGGAATTGCAACAGCCATTATGAATAGCGACTGTCTAGTCTTTGCCACCAGC 

ATTGCACAGCTTTTTGCAGAAAATGGCAATTTAGGCATCAATGTAACTATTTCC^TGTGTTGAAATGGC^ 

ATCAAACATTTTCTGGCCAGTGTTTAAAACTTCAGTTTCACAGAAAATAAGGCACCCATCTGTCTGCCAA 

CCTAAAACTCTTTCGGTAGGTGGAAGCTAGACACATGAAGGTAAATAAAAAGAAAGGCTGTTAAATACAG 

GAAACAGTTGCATGTAGTAACACTAATATATTTAAAAATAAGTCAACAGTATUlCCACTGAAAAAATATAT 

GTATATACACCCAAGATGGGCATCTTTTGTATTAAGAAAGGAAGCATTGTAAAATAATTCTGAGTTTTGT 

GTTTGTTGTAGATTGATTGTATTGTTGAAAAAGTTTGTTTTTGCGTGGGAGTGTGTGCCTGCGTGGGTGT 

GTGCX5TGTTTGGGTTTTTTTCCTTTAACTGACAAGCCATCTTGAGTGGTCATGGGCCACTGCTTTTCC 

TTGTGAGTGAATACATAGTGCTGCTGTGTGCTTTTTTTGTGTGTATTTGCTAATTTTTATTAATTTTAGT 

TTTTCATTAAATAAATTTGACTTTTCTGT 

Human SIAH1 mRNA sequence - var5 (public gi: 3041824) (SEQ 3D NO: 139) 

ATGAGCCGTCAGACTGCTACAGCATTACCTACCGGTACCTCGAAGTGTCCACCATCCCAGAGGGTGCCTG 
CCCTGACTGGCACAACTGCATCCAACAATGACTTGGCGAGTCTTTTTGAGTGTCCAGTCTGCTTTGACTA 
TGTGTTACCGCCCATTCTTCAATGTCAGAGTGGCCATCTTGTTTGTAGCAACTGTCGCCCAAAGCTCACA 
TGTTGTCCAACTTGCCGGGGCCCTTTGGGATCCATTCGCAACTTGGCTATGGAGAAAGTGGCTAATTCAG 
TACTTTTCCCCTGTAAATATGCGTCTTCTGGATGTGAAATAACTCTGCCACACACAGAAAAAGGAGACCA 
TGAAGAGCTCTGTGAGTTTAGGCCTTATTCCTGTCCGTGCCCTGGTGCTTCCTGTAAATGGCAAGGCTCT 
CTGGATGCTGTAATGCCCCATCTGATGCATCAGCATAAGTCCATTACAACCCTACAGGGAGAGGATATAG 

TTTTTCTTGCTACAGACATTAATCTTCCTGGTGC 

TCACTTCATGTTAGTCTTAGAGAAACAGGAAAAATACGATGGTCACCAGCAGTTCTTCGCAATCGTACAG 
CTGATAGGAACACGCAAGCAAGCTGAAAATTTTGCTTACCGACTTGAGCTAAATGGTCATAGGCGACGAT 
TGACTTGGGAAGCGACTCCTCGATCTATTCATGAAGGAATTGCAACAGCCATTATGAATAGCGACTGTCT 
AGTCTTTGACACCAGCATTGCACAGCTTTTTGCAGAAAATGGCAATTTAGGCATCAATGTAACTATTTCC 
ATGTGTTGAAATGGCAATC^AACATTTTCTGGCCAGTGTTTAAAACTTCAGTTTCACAGAAAATAAGGCA 
CCGATCTGTCTGCCAACCTAAAACTCTTTCGGTAGGTAGAAGCTAGACACATGAAGGTAAATAAAAAGAA 
AGGCTGTTAAATACAGGAAAGAGTTCCATGTAGTAACACTAA^ 

CAOTGAAAAAATATATGTATATACACCCAAGATGGGCATCTTTTGTATTAAGAAAGGAAGCATTGTAAAA 



GTGCCTGCGTGGGTGTGTGCGTGTTTGGGTTTTTTTCCTTTAACTGACAAGCCATCTTGAGTGGTCATGG 
GCCACTGCTTTTCCCTTTGTGAGTCAATACATAGTGCTGCTGTAAGCCGTTTTTGTGTGTATTTGCTAAT 



CCATTTTAAAGTTAGTATCTTTTGATATGGCATATTTGTTTATGGTAAAAAATTTATAACGGGT^ 

TTTTCTTTTCCCCCATTAATCAAGTCCATTGGAAATATTTTAAAACCAGCCTATTTTGGTGAACCCATGA 

GTTCCCAGAAAGTAAAGGTGACACCCGGAAAAATAATCCAAAAGCCTATTTAAAGCCACCTATAAGGTGC 

CCCCCTTTCCTGTCTTCCTACAGATGAGTCACACCTTTGAGCCTTAACCTTTGAAAGGTTAGAGAATAAA 

TTGATTTTTATAAATACTGCAAATCCAGGCTTTTGTTTCCTTTTTCCAGATATCCTTGGACAAATCACAT 

ATTTTAAAATTTGTTCTTGTATTTATTGGTTTTGC^GAAGAAGGCATCGTCATGCACAGTATTTGTAATT 

AAAAGCAAATTCATTTGTTTAAAAAGGCAGTTTGCAAAAAATGTTTTTGGTCTTTTATAATTCTCA 

Human SIAH1 mRNA sequence - var6 (pubKc gi: 17390431) (SEQ ID NO: 140) 

CGGCGCCGGG7UIGAGGCGGTGGCGCTGCCCGCGGTGGCGGGGGTTGGCGACGGAGCGCGTTGGTGCCAGG 
ACCGGGGTCCGAGGCGCGCTCTCCGCCCACAGAAATGAGCCGTCAGACTGCTACAGCATTACC?ACCGGT 
ACCTCGAAGTGTCCAC(^TCCCAGAGGGTGCCTGCCCTGACTGGCAC^CTGCATCCAACAATGACTTGG 
CGAGTCTTTTTGAGTGTCCAGTCTGCTTTGACTATGTGTTACCGCCCATTCTTCl^ATGTCAGAGTGGCCA 

TCTTGTTTGTAGCAACTGTCGCCCAAAGCTCACATGTTGTC 

CGCAACTTGGCTATGGAGAAAGTCGCTAATTCAGTACTTTTCCCCT 

AAATAACTCTGCCACACACAGAAAAAGGAGACGATGAAGAGCTCTO 

GTGCCCTGGTGCTTCCTGTAAATGGCAAGGCTCTCTGGATGCTGTAATGCCCCATCTGATGCATCAGCAT 



! 
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AAGTCCATTACAACCCTACAGGGAGAGG^ 

TTGACTGGGTGATGATGCAGTCCTGTTTTGGCTTTCACTTCATGTTAGTOTTA 
CGATGGTCACCAGCAGTTCTTCGCAATCGTAGAGCTGATAGGAACACGCAAGCAA 

TACCGACTTGAGCTAAATGGTCATAGGCGACGATTGACTTGGGAAGCGACTCCTCGATCTATTCATGAAG 
GAATTGCAACAGCCATTATGAATAGCGACTGTCTAGTCTO 
. AAAATGGCAATTTAGGCATCAATGTAACTATTO 

GTGTTTAAAACTTC^GTTTCACAGAAAATAAGGCACCCATCTGTCTGCCAACCTAAAACTCTTTCGGTAG 
GTGGAAGCTAGACACATGAAGGTAAATAAAAAGAAAGGCTGTTAAATACAGGAAACAGTTGCATGTAGTA 
ACACTAATATATTTAAAAATAAGTCAACAGTAAACCACTGAAAAAATATATGTATATACACCCAAGATGG 
GCATCTTTTGTATTAAGAAAGGAAGCATTGTAAAATAACTCTGAGTTTT 

TCCTOTAACTGACAAGCCATCTTGAGTGGTCATC 

GCTGCTGTGTGCTTTTTTTGTGTGTATTTGCTAATTTTTATTAAT^ 

ACTTTTCTGTAATTCAGGTTTTTCCTTTTTTTGTACCATTTTAAAGTTAGTAT 

TGTTTATGGTAAAAAATTTATAACGTGTTCAATATTTTCTTTTCCCCCATTAATCAGTTCAT 

TTTTAAAATCAGCrATTTTGTGAAGCCM 

AGCTATTTAAAGCATCTATAAGGTGCTCTCTTTCTGTCTTCTACAGATGAGTGACACCTTTGAGCTTAAT 
CTTTGAAAGGTTAGAGAATAAATTGATTTTTATAAATACTGC^^ 

TATCTTGGACAAATGACATATTTTAAAATTTGTTCTTGTATTTATTGGTTTTGCAGAAGAAGGCATCGTC 

ATGCACAGTATTTGTAATTAAAAGCAAATCATTTC 

TTTTATAATTCTCATTAAAAGAATATCT^^ 

AAAA 

Human S1AH1 mRNA sequence - var7 (public gi: 23274141) (SEQ ID NO: 141) 

GTCCCGTCGGTCTCGGCGCCGGGAAGAGGCGGTGGCGCTGCCCGCGGTGGCGGGGGTTGGCGACGGAGCG 
CGTTGGTGCCAGGACCGGGGTCCGAGGCGCGCTCTCCGCCCACAGAAATGAGCCGTCAGACTGCTACAGC 
^TTACCTACCGGTACCTCGAAGTGTCCACCATCCCAGAGGGTGCCTGCCCTGACTGGCACAACTGCATCC 
AACAATGACTXGGCGAGTCTTTTTGAGTGTCC^GTCTGCTTTGACTATGTGTTACCGCCCATTCTTCAAT 
GTCAGAGTGGCCATCTTGTTTGTAGCAACTGTCGC^ 

TTTGGGAT CCATTCGCAACTTGGCTATGGAGAAAGTGGCTAATT CAGTACTTTT CCCCTGTAAATATGCG 

TCTTCTGGATGTGAAATAACTCTGCCAGACAGAGAAAAAGCAGACCATGAAGAGCT 

CTTATTCCTGTCCGTGCCCTGGTGCTTCCTGTAAATGGCAAGGCTCT 

GATGCATCAGCATAAGTCCATTACAACCCTACAGGGAGAGGATATAGTTTTTCTTGCTACAGACA.TTAAT 
CTTCCTGGTGCTGTTGACTGGGTGATGATGCTVGTCCTGT^ 

AACAGGAAAAATACGATGGTCACCAGCAGTTC^TCGCTVATCGTACAGCTGATAGGAAC^ 
TGAAAATTTTGCTTACCGACTTGAGCTAAATGGTCATAGGCGACGATTGACTTGGGAAGCGACTCCTCGA 
TCTATTCATGAAGGAATTGCAACAGCCATTATGAATAGCGACTGTCTAGTCTTTGACACCAGCATTGCAC 
AGCTTTTTGGAGAAAATGGCAATTTAGGCATCAATGra 

ATTTTCTGGCCAGTGTTTAAAACTT(^GTTTCACAGAAAATAAGGCACCCATCTGTCTGCCAACCTAAAA 
CTCTTTCGGTAGGTGGAAGCTAGACACATGAAGGTAAATAAAAAGAAAGGCTGTTAAATACAGGAAACAG 
TTGCATGTAGTAACACTTAATATATTTAAAAATAAGTCAACAGTAAACCACTGAAAAAATATATGTATATA 
CACCCAAGATGGGCATCTTTTGTATTAAGAAAGGAAGCATTGTAAAATAATTCTGAGTTTTGTGTTTGTT 
GTAGATTGATTGTATTGTTGAAAAAGTTTGTTTTTGCGTGGGAGTGXGTGCCTGCGTGGGTGTGTGCGTG 
TTTGGGTTTTTTTCCTTTAACTGACAAGCCATCTTGAGTGGTCATGGGCCACTGCTTTTCCCTTTGTGAG 
TC^UVTACATAGTGCTGCTGTGTGCTTTTTTTGTGTGTATTTGCTAATTTTTATTAATTTTAGTTTTTCAT 
TAAATAAATTTGACTTTTCTGTAAAAAAAAAAAAAAAAAAAAAAAA^^ 

Human SIAH1 Protein sequence - varl (public gi: 27503514) (SEQ ID NO: 271) 

MTGKATPPSLYSWRGVLFTCLPAART^KRKEMSRQTATALPTG 

ECPVCFDYVLPPII^CQSGHLVCSNraPKLTCCPTC 

PHTEKADHEELCEFRPYSCPCPGASCKWQGSIiDAVMPHIjMHQ 

MMQS CFGFHFMLVLEKQE KYDGHQQ FFAI VQL I GTRKQAENFAYRLELNGHRRRLTWEATPRS IHEG I AT 
AIMNSDCLVFDTSIAQLFAENGNLGINVTISMC 

Human S1AH1 Protein sequence - var2 (public gi: 4506947) (SEQ ID NO: 272) 

MSRQTATALPTGTSKCPPSQRVPALTGTTASNOT 

CCPTCRGPLGSIRNIiAI^KVANSVLFPCKYASSGCEITLPHTEKADHEELCEFRPYSCPCPGAS 
IiDAVMPHLMHQHKSITTLQGEDIVFLATDINLPGAVDWVMMQSCFG 

LI GTRKQAENFAYRIiELNGHRRRLTWEATPRS IHEGI ATAI MNSDCDVFDTS I AQLFAENGNLGINVTIS 
MC 
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Uniqene Name ; SMH1 Pnigene ID : Hs. 2 8898 6 Clone ID= GD_1114 
Human SMN1 mRNA sequence - varl (public gi: 624185) (SEQ ID NO: 142) 

~CGGGGCCCCACGCTGCGCATCCGCGGGTTTGCTATGGCGATGA6CAGCGGCGGCAGTGGTGGCGGCGTCC 
CGGAGCAGGAGGATTCCGTGCTGTTCCGGCGCGGCACAGGCCAGAGCGATGATTCTGACATTTGGGATGA 
^^A^GCACTGATAAAAGCATATGATAAAGCTGTGGCTTCATTTAAGCATGCTCTAAAGAATGGTGACATT 
TGTGAAACTTCGGGTAAACCAAAAACCACACCTAAAAGAAAACCTGCTAAGAAGAATAAAAGCCAAAAGA 
AGAATACTGCAGCTTCCTTACAACAGTGGAAAGTTGGGGACAAATGTTCTGCCATTTGGTCAGAAGACGG 
TTGCATTTACCCAGCTACCATTGCTTCAATTGATTTTAAGAGAGAAACCTGTGTTGTGGTTTACACTGGA 
TATGGAAATAGAGAGGAGCAAAATCTGTCTOATCTACTTTCCCCAATCTGTGAAGTAGCTAATAATATAG 
AACAGAATGCTCAAGAGAATGAAAATGAAAGCCAAGTTTCAACAGATGAAAGTGAGAACTCCAGGTCTCC — 
TGGAAATAAATCAGATAACATCAAGCCCAAATCTGCTCCATGGAACTCTTTTCTCCCTCCACCACCCCCC 
ATGCCAGGGCCAAGACTGGGACCAGGAAAGCCAGGTCTAAAATTCAATGGCCCACCACCGCCACCGCCAC 
CAC(^CACCCCACTTACTATC^TGCTGGCTGCCTCCATTTCCTTCTGGACCACCAATAATTCCCCCACC 
ACCTCCCATATGTCCAGATTCTCTTGATGATGCTGATGCTTTGGGAAGTATGTTAATTTCATGGTACATG 
AGTGGCTATCATACTGGCTATTATATGGGTTTCAGACAAAATCAAAAAGAAGGAAGGTGCTCACATTCCT 
TAAATTAAGGAGAAATGCTGGCATAGAGCAGCACTAAATGACACCACTAAAGAAACGATCAGACAGATCT 
GGAATGTGAAGCGTTATAGAAGATAACTGGCCTCATTTCTTCAAAATATCAAGTGTTGGGAAAGAAAAAA 
GGAAGTGGAATGGGTAACTCTTCTTGATTAAAAGTTATGTAATAACCAAATGCAATGTGAAATATTTTAC 
TGGACTCTTTTGAAAAACCATCTGTAAAAGACTGGGGTGGGGGTGGGAGGCCAGCACGGTGGTGAGGCAG 
TTGAGAAAATTTGAATGTGGATTAGATTTTGAATGATATTGGATAATTATTGGTAATTTTATGGCCTGTG 
AGAAGGGTGTTGTAGTTTATAAAAGACTGTCTTAATTTGCATACTTAAGCATTTAGGAATGAAGTGTTAG 
AGTGTCTTAAAATGTTTCy^AATGGTTTAACAAAATGTATGTGAGGCGTATGTGGCAAAATGTTACAGAAT 
CTAACTGGTGGACATGGCTGTTCATTGTACTGTTTTTTTCTATCTTCTATATGTTTAAAAGTATATAATA 

AAAATATTTAATTTTTTTTTA 

Human SMTNfl mRNA sequence - var2 (public gi: 15929773) (seq id no: 143 

GGCCCCACGCTGCGCACCCGCGGGTTTGCTATGGCGATGAGCAGCGGCGGCAGTGGTGGCGGCGTCCCGG 
AGCAGGAGGATTCCGTGCTGTTCCGGCGCGGCACAGGCCAGAGCGATGATTCTGACATTTGGGATGATAC 
AGCACTGATAAAAGCATATGATAAAGCTGTGGCTTCATTTAAGCATGCTCTAAAGAATGGTGACATTTGT 
GAAACTTCGGGTAAACCAAAAACCACACCTAAAAGAAAACCTGCTAAGAAGAATAAAAGCCAAAAGAAGA 

ATACTGC^GCTTCCTTACAACAGTGGAAAGTTGGGGA 
CATTTACCCAGCTACCATTGCTTCAATTGATTTTAASAGAGAAACCT 

GGAAATAGAGAGGAGCAAAATCTGTCCGATCTACTTTCCCCAATCTGTGAAGTAGCTAATAATATAGAAC 
AGAATGCTCAAGAGAATGAAAATGAAAGCC^UVGTTTCAACAGATGA^AGTGAGAACTCCAGGTCTCCTGG 

AAATAAATC^GATAACATCAAGCCCAAATCTGCT^ 

CCAGGGCCAAGACTGGGACCAGGAAAGCCAGGTCTAAAATTCAATGGCCCACCACCGCCACCGCCACCAC 
CACCACCCCACTTACTATCATGCTGGCTGCCTCCATTTCCT 

TCCCATATGTCCAGATTCTCTTGATGATGCTGATGCTTTGGGAAGTATGTTAATTTCATGGTACATGAGT 
GGCTATCATACTGGCTATTATATGGGTTTTAGACAAAATCAAAAAGAAGGAAGGTGCTCACATTCCTTAA 
ATTAAGGAGAAATGCTGGCATAGAGCAGCACTAAATGACACCACTAAAGAAACGATCAGACAGATCTGGA 
ATGTGAAGCGTTATAGAAGATAACTGGCCTCATTTCTTCAAAATATC7AGTGTTGGGAAAGAAAAAAGGA 
AGTGGAATGGGTAACTCTTCTTGATTAAAAGTTATGTAATAACCAAATGCAATGTGAAATATTTTACTGG 

ACTCTATTTTGAAAAACCATCTGTAAAAGACTGAGGTGGGGGTG^ 

TGAGAAAATTTGAATGTGGATTAGATTTTGAATGATATTGGATAATTATTGGTAATTTTATGAGCTGT 
GAAGGGTGTTGTAGTTTATAAAAGACTGTCTTAATTTGCATACTTAAGCATTTAGGT^ATGAAGTGTTAGA 

GTGTCTTAAAATGTTTCAAATGGTTTAACAAAA^ 

TAACTGGTGGACATGGCTOTTCATTGTACTGTTTTTTTCTATCTTCT 

AAATATTTAATTTTTTTTTAAAAAAT^AAAAAAAAAAAAAA 

Human SMN1 mUNA sequence - var3 (public gi: 13259511) (SEQ ID NO: 144) 

CCACAAATGTGGGAGGGCGATAACCACTCGTAGAAAGCGTGAGAAGTTACTACAAGCGGTCCTCCCGGCC 
ACCGTACTGTTCCGCTCCCAGAAGCCCCGGGCGGCGGAAGTCGTCACTCTTAAGAAGGGACGGGGCCCCA 
CGCTGCG(^CCCGCGGGTTTGCTATGGCGATGAGCAGCGGCGGCAGTGGTGGCGGCGTCCCGGAGCAGGA 
GGATTCCX3TGCTX3TTCCGGCGCGGCACAGGCCAGAGCGATGATTCTGAC^ 

ATAAAAGCATATGATAAAG CTGTGG CTTCATTTAAGCATGCTCTAAAGAATGGTGACATTTGTGAAACTT 

CGGGTAAACCAAAAACCACACCTAAAAGAAAACCTGCTAAGAAG^ 

AGCTTCCTTACAAGAGTGGAAAGTTGGGGACAAATGTTCT 

CCAGCTACC^TTGCTTCAATTGATTTTAAGAGAGAAACCTGTGTTGTGGTTTACACTGGATATGGAA^ 
GAGAGGAGCAAAATCTGTCCGATCTACTTTCCCCAATCTGTC 
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TCAAGAGAATGAAAATGAAAGC 

TCAGATAACATCAAGCCCAAATCTGCTCCATGGAACTCTTTTCT 
CAAGACTGGGACCAGGAAAGATAATTCCCCCA.CCACCTCCCATATGTCCAGATTOT 

TGCTTTGGGAAGTATGTTAATTTCATGGTACATGAGTGGCTATCATACTGGCTATTATATGGGTTTCAGA 
CAAAATC^AAAAGAAGGAAGGTGCTCACATTCCTTAAATTAAGGAGAAATGCTGGCATAGAG 
AATGACACCACTAAAGAAACGATCAGACAGATCTGGAATGTGAAGCGTTATAGAAGATAACTGGCCTCAT 
TTCTTCAAAATATCAAGTGTTGGGT^AAGAAAAAAGGAAGTGGAATGGGTAACTCT^ 

ATGTAATAACCAAATGCAATGTGAAATATTTTACTGGACTCTTTTGAAAAACCATCTGTAAAAGACTGGG 
GTGGGGGTGGGAGGCCAGCACGGTGGTGAGGCAGTTGAGAAAATTTGAATGTGGATTAGATTTTGAATGA 
TATTGGATAATTATTGGTAATTTTATGGCCTGTGAGAAGGGTGTTGTAGTTTATAAAAGACTGTCTTAAT 
TTGCATACTTAAGCATTTAGGAATGAAGTGTTAGAGTGTCTT^ 

TATGTGAGGCGTATGTGGCAAAATGTTAC^GAATCrrAACTGGTGGACATGGCTGTTCATTGTACTGTTTT 
TTTCTATCTTCTATATGTTTAAAAGTATATAATAAAAATATTTAATTTTTTTTTA 

Human SMN1 mRNA sequence - var4 (public gi: X3iii8i7) (SEQ ID NO: 145) 

GGGGCCCCACGCTGCGCACCCGCGGGTTTGCTATGGCGATGAGCAGCGGCGGCAGTGGTGGCGGCGTCCC 

GGAGCAGGAGGATTCCGTGCTGTTCCGGCGCGGCACAGGCCAGAGCGATGATTCTGAGATTTGGGATGAT 

ACAGCACTGATAAAAGGATATGATAAAGCTGTGGCTTCATTTAAGCATGCTCTAAAGAATGGTGACZATTT 

GTGAAACTTCGGGTAAACCAAAAACCACACCTAAAAGAAAACCTGCTAAGAAGAATAAAAGCCAAAAGAA 

GAATACTGCAGCTTCCTTACAACAGTGGAAAGTTGGGGACAAATGTTCTGCGATTTGGTCAGAAGACGG 

TGCATTTACCCAGCTACC^TTGCTTGAATTGATTTTAAGA 

ATGGAAATAGAGAGGAGCAAAATCTGTCCGATCTACTTTCCCCAATCTGTGAAGTAGCTAATAATATAGA 

ACAGAATGCTCAAGAGAATGAAAATGAAAGCCAAGTTTCAACAGATGAAAGTGAGAACTCCAGGTCTCCT 

GGAAATAAATCAGATAACATCAAGCCCAAATCTGCTCCATGGAACTCTTTTCTCCCTCCACCACCCCCCA 

TGCCAGGGCCT^GACTGGGACCAGGAAAGCCAGGTCTAAAATTCAATGGCCCACCACCGCCACCGCCACC 

ACCACCACCCCACTTACTATGATGCTGGCTGCCTCCATTTCCT^ 

CCTCCCATATGTCC^GAaTCTCTTGAT^ 

GTGGCTATCATACTGGCTATTATATGGAAATGCTGGCATAGAGCAGCACTAAATGACACCACTAAAGAAA 
CGATCAGAGAGATCTGGAATGTGAAGCGTTATAGAAGATAACTGGCCTGATTTCTTCAAAATATCAAG^ 
TTGGGAAAGAAAAAAGGAAGTGGAATGGGTAACTCTTCTTGATTA7VAAGTTATGTAATAACCAAATGCAA 
TGTGAAATATTTTACTGGACTCTATTTTGAAAAACCATCTGTAAAAGACTGAGGTGGGGGTGGGAGGCCA 
GCACGGTGGTGAGGCAGTTGAGAAAATTTGAATGTGGATTAGATTTTGAATGATATTGGATAATTATTGG 
TAATTTTATGAGCTGTGAGAAGGGTGTTGTAGTTTATAAAAGACTGTCTTAATTTGCATACTTAAGCATT 
TAGGAATGAAGTGTTAGAGTGTCTTAAAATGTTTCAAATGGTTTAACAAAATGTATGTGAGGCGTATGTG 
GCAAAATGTTACAGAATCTAACTGGTGGAGATGGCTGTTCATTGTACT 



HumanSMNlmRNAsequen.ee- var5 (public gi: 13259515) (SEQ ID NO: 146) 

CCACAAATGTGGGAGGGCGATAACGACTCGTAGAAAGCGTGAGAAGTTACTACAAGCGGTCCTCCCGGCC 

ACCGTACTGTTCCGCTCCCAGAAGCCCCGGGCK3GCGGAAGTCGTGACTCTTAAGAAG 

CGCTGCGCACCCGCGGGTTTGCTATGGCGATGAGCAGCGGCGGCAGTGGTGGCGGCGTCCCGGAGCAGGA 

GGATTCCGTGCTGTTCCGGCGCGGCACAGGCCAGAGCGATGATTCTGACATTTGGGATGATACAGCACTG 

ATAAAAGCATATGATAAAGCTGTGGCTTCATTTAAGCATGCTCTAAAGAATGGTGACATTTGTGAAACTT 

CGGGTAAACGAAAAACCACACCTAAAAGAAAACCTGCTAAGAAGAATAAAAGCCAAAAGAAGAAT^ 

AGCTTCCTTACAACAGTGGAAAGTTGGGGACAAATGTTCTGCCATTTGGTCAGAAGACGGTTGCA 

CGAGCTACCATTGCTTCAATTGATTTTAAGAGAGAAACCTGTGTTGTGGTTTACA 

GAGAGGAGCAAAATCTGTCCGATCTACTTTCCCCAATCT 

TCAAGAGAATGAAAATGAAAGCC^UVGTTTCAACAGATGAAAGT 

TCAGATAACATCAAGCCCAAATCTGCTCCATGGAACTCTTTTCTCCCTCCACG^ 

CAAGACTGO^ACCAGGAAAGCCAGGTCTAAAATTCAATC 

CCACTTACTATCATGCrGGCTGCCTCGATTTCCTTCTGGACCACCAATAA 

TGTCCAGATTCTCTTGATGATGCTGATGOTTTGGGAAGTATGTTAATTTCATGGTACATGAGTGGOT 

ATACTGGCTATTATATGGGTTTGAGACAAAATCAAAAAGAAG^ 

AGAAATGCTGGCATAGAGCAGCACTAAATGACACCACTAAAGAAACG^ 

GGGTTATAGAAGATAACTGGCCT CATTT CTT CAAAATATCAAGTGTTGGGAAAGAAAAAAGGAAGTGGAA 

tgggtaactcttcttgattaaaagttatgtaataacca 

tgaaaaaccatctgtaaaagactggggtgggggtgggaggccagcacggtggtgaggcagttgagaaaat 
ttgaatgtggattagattttgaatgatattggataattattggtaattttatggcctgtgagaagggtgt 
tgtagtttataaaagactgtcttaatttgcatacttaagcatttaggaatgaagtgttagagtgtctt^ 
aatgtttcaaatggtttaacaaaatgtatg^^ 

GACATGGCTGTTCATTGTACTGTTTTTTTCrATCT 
ATTTTTTTTTA 
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Human SMN1 Protein sequence- varl (public gi: 13259512) (seq id no: 273) 

MAMSSGGSGGGVPEQEDSVLF^GTGQSDDSDIWDDTALIKAYDKAVASFKHALKN^ 
KRKPAKKNKSQKKNTAASLQW 

LLSPICEVAlWIEQNAQENENESQVSTDESENSRSPGNKSDNIKPKSAPWNSFLPPPPPMPGPRIiGPGKI 
IPPPPPICPDSLDDADALGSMDISWyMSGYHTGYYMGFRQNQKEGRCSHSIiKr 

Human SMN1 Protein sequence - var2 (public gi: 1265418D (seq id no: 274) 

MAMS SGGSGGGVPEQEDSVLFRRGTGQSDDSD I WDDTALI KAYDKAVAS FKHALKNGDI CETSGKPKTTP 
KRKP AKKtOCSQKKNTAASIjQQWKVGDKCSAIWSEDGCI YPATI AS IDFKRETCVWYTGYGNREEQNLSD 
IiLSPICTVANNIEQNAQENENESQVSTDESENSRSPGNKSDNIKPKSAPWNSFLPPPPPMPGPRIaGPGKP 
GLK^GPPPPPPPPPPHLLSC^LPPFPSGPPIIPPPPPICPDSIJDDADALGSMLISWYMSGYHTGYY^ 

IiA 

Human SMN1 Protein sequence - var3 (public gi: 4507091) (seq id no: 275) 

MAMS SGGSGGGVPEQEDSVLFRRGTGQSDDSD I WDDTALI KAYDKAVAS FKHALKNGD I CETSGKPKTT 

KRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKRETCVWYTG 

LLS P ICEVANNIEQNAQENENESQVSTDES ENSRSPGNKSDNI KPKSAPWNSFLPPPPPMPGPRLGPGKP 

GLKFNGPPPPPPPPPPHLLSCWLPPFPSGPPIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGF 

RQNQKEGRCSHSLN 

Human SMN2 mRNA sequence - varl (public gi: 7364io) (SEQ ID NO: 147) 

GCGATGAGCAGCGGCGGCAGTGGTGGCGGCGTCCCGGAGCAGGAGGATTCCGTGCTGTTCCGGCGCGGCA 
CAGGCCAGAGCGATGATTCTGACATTTGGGATGATACAGCACTGATAAAAGCATATGATAAAGCTGTGGC 

TTC^TTTAAGC^TGCTCTAAAGAATGGTGACA^ 

AGAAAACCTGCTAAGAAGAATAAAAGCCAAAAGAAGAATACTGCAGCTT^ 
GGGACAAATGTTCTGCCATTTGGTCAGAAGACGGTTGCATTTACCCAGCrACC^ 

TAAGAGAGAAACCTGTGTTGTGGTTTACACTGGATATGGAAATAGAGAGGAGCAAAATCTGTCCGATCTA 

CTTTCCCCAATCTGTGAAGTAGCTAATAATATAGAACAGAATGCTCAAGAGT^ATGAAAATGAAAGCCAAG 

TTTCAACAGATGAAAGTGAGAACTCCAGGTCTCCTGGAAATAAATCAGATAACATCAAGCCCAAATCTGC 

TCCATGGAACTCTTTTCTCCCTCCACCACCCCCCATGCCAGGGCCAAGACTGGGACCAGGAAAGCCAGGT 

CTAAAATTCAATGGCCCACCACroCCACCGCCACCACCACCACCCCA^ 

CATTTCCTTCTGGACCACCAATAATTCCCCCACCAC^ 

TGCHTTGGGAAGTATGTTAATTTCATGGTACATGAGTGGCTATCATACTGGCT 

CAAAATGAAAAAGAAGGAAGGTGCTC^CATTCCTTAAATTAAGGAGAAATGCTGGCATAGAGCAGCACT 

AATGACACCACTAAAGAAACGATCAGACAGATCTGGAATGTGAAGCGTTATAGAAGATAACTGGCCTCAT 

CTCTTCAAAATATCAAGTGTTGGGAAAGAAAAAAGGAAGTGGAATGGGTAACTCTTCTTGATTAAAAGTT 

ATGTAATAACCAAATGCAATGTGAAATATTTTACTGGACTCTATTTTGAAAAACCATCTGTAAAAGACTG 

AGGTGGGGGTGGGAGGCCAGCACGGTGGTGAGGOVGTTGAGAAAATTTGAATGTGGATTAGACTTTGAAT 

GATATTGGATAATTATTGGTAATTTTATGAGCTGTGAGAAGGGTGTTGTAGTTTATAAAAGACTGTCTTA 

ATTTGCATACTTAAGCATTTAGGAATGAAGTGTTAGAGTGT 

TGTATGTGAGGCGTATGTGGCAAAATGTTACAGAATCTAACTGGTGGACATGGCTGTTCATTGTACTGTT 
TTTTTCTATCTTCTATATGTTTAAAAGTATATAATAAAAATATTTAAT^ 

Human SMN2 mRNA sequence - var2 (public gi: 13259530) (seq id no: 148) 

CCACAAATGTGGGAGGGCGATAACCACTCGTAGAAAGCGTGAGAAGTTACTACAAGCGGTCCTCCCGGCC 

ACCGTACTGTTCCGCTCCCAGAAGCCCCGGGCGGCGGAAGTCGTC^CTCTTAAGAAGGGACGGGGCCCCA 

CGCTGCGCACCCGCGGGTTTGCTATGGCGATGAGCAGCGGCGGCAGTGGTGGCGGCGTCCCGGAGCAGGA 

GGATTCCGTGCTGTTCCGGCGCGGCAC^GGCCAGAGCGATGATTCTGACATTTGGGATGATACA^ 

ATAAAAG CATATGATAAAGCTGTGG CTTCATTTAAGCATGCTCTAAAGAATGGTGACATTTGTGAAACTT 

CGGGTAAACCAAAAACCACACCTAAAAGAAAACCTGCTAAGAAGAATAAAAGCCAAA 

AGCTTCCTTAC^CAGTGGAAAGTTGGGGACAAATGTTCTGCCATTTGGTCAGAAGACGGTTGCATTTAC 

CCAGCTACCATTGCTTCAATTGATTTTAAGAGAGAAACCTGTGTTGTGGTTTA^ 

GAGAGGAGCAAAATCTGTCCGATCTACTTTCCCCAATCTGTGAAGTAGCTAATAATATAGAACAGAATGC 
TCAAGAGAATGAAAATGAAAGCCAAGTTTCAACAGATGAAAGTGAGAACTCCAGGTCTCCT * 
TCAGATAACATCAAGCCCAAATCTGCTCCATGGA 

CAAGACTGGGACCAGGAAAGATAATTCCCCCACCACCTCCC^TATGTCCAGATTCTCTTGATGATGCTGA 

TGCTTTGGGAAGTATGTTAATTTCATGGTACATGAGTGGCTATCATAOTGGCTATTATATGGAAATGCT^ 

GCATAGAGCAGC^CTAAATGACACCACTAAAGAAACGATCAGACAGATCTGGAATGTGAAGCGTTATAGA 

AGATAACTGGCCTCATTTCTTCAAAATATGAAGTGTTGGGAAAGAAAAAAGGA 

TTCTTGATTAAAAGTTATGTAATAACCAAATGCAATGTGAAATATTTTACT 
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TCTGTAAAAGACTGAGGTGGGGGTGGGAGGCCAGCACGGTGGTGAGGCAGTTGAGAAAATTTGAATGTGG 

ATTAGATTTTGAATGATATTGGATAATTATTGGTAATTTTATGGCCTO^ 
AAAAGi^CTGTCTTAATTTGCATACTTAAGCATTTAGGAATGAAGTGTTAGAGTGTCTT 
ATGGTTTAACAAAATGTATGTGAGGCGTATGTGGCAA^TGTTACAGAATCTAACTGGTGGACATGGCTG 
XTCATTGTACTGTTTTTTTCTATCTTCTATATGTTTAAAAGTATATAATAAAAATA 

AAA 

-^Human SMN2 mRNA sequence - var3 (public gi: 1325952a) (seq id no: 149) 

CCACAAATGTGGGAGGGCGATAACCACTCGTAGAAAGCGTGAGAAGTTACTACAAGCGGTCCTCCCGGCC 
ACCGTACTGTTCCGCT*CCC^GAAGCCCCGGGCGGCGGAAGTCGTC^CT 

CGCTGCGCACCCGCGGGTTTGCTATGGCGATGAGCAGCGGCGGCAGTGGTGGCGGCGTCCCGGAGCAGGA 

GGATTCCGTGCTGTTCCGGCGCX3GCACAGGCCAGAGCGATGATTCTGACATTTGGGATGATACAGCACTG ^ 

ATAAAAGCATATGATAAAGCTGTGGCTTCATTTAAGCATGCTCTAAAGAATGGTGACATTTGTGAAACTT 

CGGK3TAAACGAAAAACCACACCTAAAAGAAAACCTGCTAAGAAGAAT^ 

AGCTTCCTTAGAACAGTGGAAAGTTGGGGACAAATGl^ 

CCAGCTACCATTGCTTCAATTGATTTTAAGAGAGAAACCTGTO 

GAGAGGAGCAAAATCTGTCCGATCTACTTTCCCCAA 

TCAAGAGAATGAAAATGAAAGCCAAGTTTCAACAGATGAAAGTGAGAACTCCAGGTCTCCTGGAAATAAA 
TCAGATAACATCAAGCCCAAATCTGCTCCATGGAACT 

CAAGACTGGGACCAGGAAAGATAATTCCCCC^CCACCTCCCATATGTCCAGATTCTCTTGATGATGCTGA 

TGCTTTGGGAAGTATGTTAATTTCATGGTACATGAGTGGCTATCATACTGGCTATTATATGGGTTTTAGA 

CAAAATCAAAAAGAAGGAAGGTGCTCACATTCCTTAAATTAAGGAGAAATGCTGGCATAGAGCAGCACTA 

AATGACACCACTAAAGAAACGATCAGACAGATCTGGAATGTGAAGCGTTATAGAAGATAACTGGCCTCAT , 

TTCITGAAAATATCAAGTGTTGGGAAAGAAAAAAGGAAGTGGAATGGGTAACT 

ATGTAATAACCAAATGCAATGTGAAATATTTTACTGGACTCTTTTGAAAAACCATCT 

GTGGGGGTGGGAGGCCAGCACGGTGGTGAGGCAGTTGAGAAAATTTGAATGTGGATTAGATTTTGAATGA 

7ATTGGATAATTATTGGTAATTTTATGGCCTGTGAGAAGGGTGTTGTAGTTTATAAAAGACTGTCTTAAT 

TTGCATACTTAAGCATTTAGGAATGAAGTGTTAGAGTGTCTTAAAATGTTTCAAATGGTTTAACAAAATO 

TATGTGAGGCGTATGTGGC?UIAATGTTAC^GAATCTAACTGGTGGACATGGCTGTTCATTGTACTGTTTT 

TTTCTATCTTCTATATGTTTAAAAGTATATAATAAAAATATTTAATTTTTTTTTAAA 

Human SMN2 mRNA sequence - var4 (public gi: 13259526) (seq id no: 150) 

CCACAAATGTGGGAGGGCGATAACCACTCGTAGAAAGCGTGAGAAGTTACTACAAGCGGTCCTCCCGGCC 
ACCGTACTGTTCCGCTCCCAGAAGCCCCGGGCX^CGGAAGTCGTCACTCTTAAGAAGGGACGGGGCCC^ 
CGCTGCGCACCCGCGGGTTTGCTATGGCGATGAGCAGCGGCGGCAGTGGTGGCGGCGTCCCGGAGCAGGA 
GGATTCCGTGCTGTTCCGGCGCGGCACAGGCCAGAGCGATGATTCTGACATTTGGGATGATACAGCACTG 
ATAAAAGCATATGATAAAGCTGTGGCTTCATTTAAGCATGCTCTAAAGAATGGTGACATTTGTGAAACTT 
CGGGTAAACCAAAAACCACACCTAAAAGAAAACCTGCTAAGAAGAATAAAAGCCAAAAGAAGAATACTGC 
AGCTT.CCTTACAACAGTGGAAAGTTGGGGACAAATGTTCTGC^ 

CCAGCTACCATTGCTTCAATTGATTTTAAGAGAGAAACCTGTGTTGTGGTTTACACTGGATATGGAAATA 
GAGAGGAGCAAAATCTGTCCGATCTACTTTCCCC^^TCTGTGAAGTAGCTAATAATATAGAACAGAATC 
TCAAGAGAATGAAAATGA^GCCAAGTTTCAAC^GATGAAAGTGAGAACTCCAGGTCTCCTGGAAATAAA 
TCAGATAACATCAAGCCCAAATCTGCTCCATGGAACTCTTTTCrCCCTCC^ 

CAAGACTGGGACCAGGAAAGCCAGGTCTAAAATTCAATGGCCCACCACCGCCACCGCGACCACCA^ 
CCACTTACTATCATGCTGGCTGCCTCC^^ 

TGTCCAGATTCTCTTGATGATGCTGATGCTTTGGGAAGTATGTTAATTTCATGGTACA^ 
ATACTGGCTATTATATGGAAATGCTGGCATAGAGCAGCACTA^ 

AGATCTGGAATGTGAAGCGTTATAGAAGATAACTGGCCTCATTTCTTCAAAATATCAAGTGTTGGGAAAG 

AAAAAAGGAAGTGGAATGGGTAACTCTTCTTGATTAAAAGTTATGTAATAACCAAATGCAATGTGAA 

TTTTACTGGACTCTTTTGAAAAACCATCTGTAAAA 

AGGCAGTTGAGAAAATTTGAATGTGGATTAGATTTTGAATGATATTGGATAATTATTGGTAATTTTATGG 

CCTGTGAGAAGGGTGTTGTAGTTTATAAAAGACTGTCTTAATTTGCATACTTAAGCATTTAGGAATGAAG 

TGrTAGAGTGTCTTAAAATGTTTCAAATGGTTTAACAAAATGTATGTGAGGCGTATGTGGCAAAATGTTA 

C^GAATOTAACTGGTGGACATGGCTGTTCATTGTACTGTTTTTTTCTATCTTCTATATGT 

ATAATAAAAATATTTAATTTTTTTTTAAA 

Human SMN2 mKNA sequence - var5 (public gi: 13259525) (seq id no: 151) 

CCACAAATGTGGGAGGGCGATAACCACTCGTAGAAAGCGTGAGAAGTTACTACAAGCGGTCCTCCCGGCC 
ACCGTACTGTTCCGCTCCCAGAAGCCCCGGGCGGCX3GAAGTCGTCACTCTTAAGAAGGGACTC 
CGCTGCGCACCCGCGGGTTTGCTATGGCGATGAGCAGCGGCGGCAGTGGTGGCGGCGTCCCGGAGCAGGA 
GGATTCCGTGCTGTTCCGGCGCGGCACAGGCCAGAGO?ATGACT 

ATAAAAGC^TATGATAAAGCTGTGGCTTCATTTAAGCATGCTCTAAAGAATGGTGACATTTGTGAAACTT . 
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CGGGTAAACCAAAAACCACACCTAAAAGAAAACCTGCT^ 

AGCTTCCTTAC^CAGTGGAAAGTTGGGGACAAATGTTCTGCCATTTGGTCAGAAGACGGTTGCATTTAC 

CCAGCTACC^TTGCTTC^ATTGATTTTAAGAGAGAAACCT 
GAGAGGAGGAAAATCTGTCCGATCTACTTTCCCCAATCTGTGAAGTAGCT 

TCAAGAGAATGAAAATGAAAGCCAAGTTTCAACAGATGAAAGTGAGAACTCCAGGTCTCCTGGAAATAAA 
TCAGATAACATCAAGCCCAAATCTGCTCCATGGAACTCTTTTCTCCCTCCA 

CAAGACTGGGACCAGGAAAGCCAGGTCTAAAATTCAATGGCC 
CCACTTACTATCATGCTGGCTGCCTCCATTTCCTTCTGGACCACC^ 

TGTCCAGATTCTCTTGATGATGCTGATGCTTTGGGAAGTATGTTAATTTCATGGTACATGAGTGGCTATC 
ATACTGGCTATTATATGGGTTTTAGAOU^ 

AGAAATGCTGGCATAGAGCAGCACTAAATGACACCACTAAAGAAACGATCAGACAGATCTGGAATGTGAA 
* GCGTTATAGAAGATAACTGGCCTCATTTCTTCAAAATATCAAGTGTTGGGAAAGAAAAAAGGAAGTGGAA 
TGGGTAACTCTTCTTGATTAAAAGTTATGTAATAACCAAATGCAATGTGAAATATTTTACTGGACTCTTT 
TGAAAAACCATCTGTAAAAGACTGAGGTGGGGGTGGGAGGCCAGCACGGTGGTGAGGCAGTTGAGAAAAT 
TTGAATGTGGATTAGATTTTGAATGATATTGGATAATTATTGGTAATTTTATGGCCTGTGAGAAGGGTGT 
TGTAGTTTATAAAAGACTGTCTTAATTTGCATACTTAAGCATTTAGGAATGAAGTGTTAGAGTGTCTTAA 
AATGTTTCAAATGGTTTAACAAAATGTATGTGAGGCGTATGTGGCAAAATGTTACAGAATCTAACTGGTG 

GACATGGCTGTTC^TTGTACTGTTT^ 
ATTTTTTTTTAAA 

Human SMN2 Protein sequence - varl (public gi: 7364ii) (seq id no: 276) 

AMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDICETSGKPKTTPK 
RKPAKKNKSQKKNTAASI^QWKVGDKCSAIWSEDGCIYPATIASIDFKRETW 

LSP ICEVANNI EQNAQENENESQVSTDESENSRS PGNKSDNI KPKS APWNS FLPPPPPMPGPRLGPGKPG 
liKFNGPPPPPPPPPPHLLSCWLPPFPSGPPIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGFR 

QNQKEGRCSHSLiN 

Human SMN2 Protein sequence - var2 (public gi: 13259531) (seq id no: 277) 

MAMSSGGSGGGVPEQEDSVIiFRRGTGQSDDSDI WDDTALI KAYDKAVASFKHALKNGD I CETSGKPKTTP 
KRKPAKKNKSQKKOTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR^ 

LLS P I CEVANNI EQNAQENENESQVSTDE S ENSRS PGNKSDNI KP KSAPWNSFLP PPPPMPGPRLGPGKI 
I PPPPPI CPDSLDDADALGSMLISWYMSGYHTGYYMEMLA 

Hwnan SMN2 Protein sequence - var3 (public gi: 13259529) (seq id no: 278) 

MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGD I CETSGKPKTTP 
KRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKRETCV\^ 

LLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKSDNIKPKSAPWNSFLPPPPPMPGPRIiGPGKI 
I PPPPPI CPDSLDDADALGSMLISWYMSGYHTGYYMGFRQNQKEGRCSHSLN 

Human SMN2 Protein. sequence - var4 (public gi: 13259527) (seq id no: 279) 

mamssggsgggvpeqedsvlfrrgtgqsddsdiwddtalikaydkavasf 

krkpakknksqkkntaaslqqwkvgdkcsaiwsedgciypatiasidfkretcvwyto 

LLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKSDNIKPKSAPWNSFLPPPPPMPGPRIiGPGK^ 
GLKFNGPPPPPPPPPPHLLS(>?IjPPFPSGPPriPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMEM 

LA 

Human SMN2 Protein sequence - var5 (public gi: 10937869) (seq id no: 280) 

MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDI WDDTALI KAYDKAVAS FKHALKNGD I CETSGKPKTTP 
KRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIAS 

LLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKSDNIKPKSAPWNSFLPPPPPMPGPRLGPGKP 
GLKFNGPPPPPPPPPPHLLSCWLPPFPSGPPIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGF 
RQNQKEGRCSHSLN 



Unigene Name: SNX1 Unigene ID: Hs. 498154 

Human SNX1 mKN A sequence - vari (public gi: 3152939) (SEQ ID NO: 152) 

ATGGCGTCGGGTGGTGGTGGCTGTAGCGCTTCGGAGAGACTGCCTCCGCCCTTCCCCGGCCT^ 

AGTCCGAGGGGGCGGCCGGGGGATCAGAACCCGAGGCTGGGGACAGCGACACCGAGGGGGAGGACATTTT 

CACCGGCGCCGCGGTGGTCAGTAAACATC7VGTCTCCAAAGATAACT 
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GGCTCCAAAGAAAATGGGATCCAT^ 

TGGAGCTATCCTTGGACAGCACACAAAATAATCAGAAGAAGGTGCTAGCCAA^ 
TCCTCAGGAAGCCACAAATTCTTCGAAGCCCCAGC^ 

GATCAATTTGATTTGACAGTCGGTATAACTGATCCTGAGAAGATAGGGGATGGTATGAATGCATATGTAG 
CCTACAAAGTTACAACACAGACAAGCTTACCATTGTTCAGAAGCAAACAGTTTGCAGTAAAAAGAAGATT 

TAGTGACTTTCTGGGTCTTTATGAGAAGCT^^ 

TCCCCGGAGAAGAGCCTCATAGGGATGACAAAAGTGAAAGTTGGGAAGGAAGATTCTTCTTCTGCAGAA 
TTCTTGAAAAACGGAGGGCCGCTTTAGAAAGGTACCTTCAGAGGATTGTAAATCATCCTACCATGTTACA 

GGACCCTGACGTC^GAGAGTTCTTGGAAAAAGAA^ 
GGTGCTGGTCTCCTCAAGATGTTC^^ 

AATCAGACATTTGGTTTGAGGAGAAGCTCCAGGAGGTAGAGTGTGAGGAGCAGCGCTTACGGAAACT 
TGCTGTTGTAGAAACTCTAGTCAACCATAGGAAAGAGCTAGCGCTGAACACAGCCCAGTTTGCAAAGAGT 

CTAGCCATGCraGGGAGCTCTGAGGACAACACMG 
AAGAAAAAATTGAGCAGCTCCACCAGGAACAGGCCAACAATGAC 

TGACTACATTCGCCTCCTGGCCATAGTCCGCGCTGCCTTCGACCAGCGCATGAAGACATGGCAGCGCTGG 
CAGGATGCCCAAGCCACACTGCAGAAGAAGCGGGAGGCCGAGGCTCGGCTGCTGTGGG 
ATAAGCTGCAGCAGGC CAAGGACGAGAT CCTCGAGTGGGAGT CTCGGGTGACTCAATATGAAAGGGACTT 
CGAGAGGATTTCAACAGTGGTCCGAAAAGAAGTGATAOGGTTTGAGAAAGAGAA 

AACCACGTGATCAAGTACCTTGAGACACTCCTTTACTCACAGCAGCAGCTGGCAAAGTACTGGGAAGCCT 
TCCTTCCTGAGGCAAAGGCCATCTCCTAA 

Human SNX1 mRNA sequence - var2 (public gi: 3152941) (SEQ ID NO: 153) 

ATGGCGTCGGGTGGTGGTGGCTGTAGCGCTTCGGAGAGACTGCCTCCGCCCTTCCCCGGCCTGGAGCCGG 

AGTCCGAGGGGGCGGCCGGGGGATCAGAACCCGAGGCTGGGGACAGCGACACCGAGGGGGAGGACATTTT 

CACCGGCGCCX3CGGTGGTCAGTAAACATCAGTCTCGAAAGAT^ 

GGCTCCAAAGAAAATGGGATCCATGAAGAACAAGACCAAGAGCCA 

TGAATGCATATGTAGCCTACAAAGTTACAACAC^ 

AGTAAAAAGAAGATTTAGTGACTTTCTGGGTCTTTATGAGAAGCTTTCCGAGAAGCACTCTCAGAATGGC 
TTCATTGTCCCTCt^TCCCCGGAGAAGAGCCTCATAGGGATGACAAAAGTGAAAGTTGGGAAGGAAGATT 
^rvr^rrrvr^nn an^a TTTPTTr.aii a a a nr:riar;r:nrrnPTTTAGAAAGGTACCTTC AGAGGATTGTAAATCA 



TCCTACCATGTTACAGGACCCTGACGTC^GAGAGTTCTTGGAAAAAGAAGAGCTGCCAUUT^CL.^ l^k* j. 
ACCCAGAGATTGAGTGGTGCTGGTCTCCTC^^GATGCT 

CCATCAAGATGAATGAATCAGACATTTGGTTTGAGGAGAAGCTCCAGGAGGTAGAGTGTGAGGAGCAGCG 
CTTACGGAAACTGCATGCTGTTGTAGAAACTCTAGTC^CCTVTAGGAAAGAGCTAGCGCTGAAC^CAGCC 
CAGTTTGCAAAGAGTCTAGCGATGCTTGGGAGCTCTGA& 

AGCTGGCTGAGGTGGAAGAAAAAATTGAGCAGCTCCACCAGGAACAGGCCAACAATGACTTCTTCCTCCT 
TGCTGAGCTCCTGAGTGACTACATTCGCCTCCTGGCCATAGTCCGCGCTGCCTTCGACCAGCGCATGAAG 
ACATGGC^GCGCTGGCAGGATGCCCAAGCCACACTGCAGAAGA 

GGGCCAACAAGCCTGATAAGCTGCAGCAGGCCAAGGACGAGATCCTCGAGTGGGAGTCTCGGGTGACTCA 
ATATGAAAGGGACTTCGAGAGGATTTCAACAGTGGTCCGAAAAGAAGTGATACGGTTTGAGAAAGAGAAA 

TCGAAGGACTTGAAGAACC^CGTGATCAAGT^ 
AGTACTGGGAAGCCTTCCTTCCTGAGGCAAAGGCCATCTCCTAA 



Human SNX1 mRNA sequence - var3 (public gi: 30582804) (SEQ ID NO: 154) 

ATGGCGTCGGGTGGTGGTGGCTGTAGCGCTTCGGAGAGACTGCCTCCGCCCTTCCCCGGCCTGGAGCCGG 

AGTCCGAGGGGGCGGCCGGGGGATCAGAACCCGAGGCTGGGGACAGCGACACCGAGGGGGAGGACATTTT 

CACCGGCGCCGCGGTGGTC^GTAAACATCAGTCTCCAAAGATAACTACATCCCTTCTTCCCATC^ 

GGCTCCAAAGAAAATGGGATCCATGAAGAACAAGACCAAGAGCCACAGGATCTCTTTGCAGATGCC^ 

TGGAGCTATCCTTGGAC^CACACAAAATAAT^ 

TCCTCAGGAAGCCACAAATTCTTCGAAGCCCCAGCCAACCT 

GATCAATTTGATTTGAGAGTCGGTATAACTGATCCTGAGAAGATAGGGGATGGTATGAATGCATATGTAG 

CCTACAAAGTTACAACACAGACAAGCTTACCATTGTTCAGAAGCA^ 

TAGTGACTTTCTGGGTCTTTATGAGAAGC 

CCCCCGGAGAAGAGCCTCATAGGGATGACAAAAGTGAAAGTTGGGAAGGAAGATTCTTCTTCTGCAGAAT 

TTCTTGAAAAACGGAGGGCCGCTTTAGAAAGGTACCTTCAGAGGATT^ 

GGACCCTGACGTCAGAGAGTTCTTGGAAAAAGAAGAGCraCCACGTGCCGTGGG 

GGTGCTGGTCTCCTCAAGATGTTCAACAAAGCGACAGAT^ 

i^TCAGAGATTTGGTTTGAGGAGAAGCTCCAGGAGGTA^^ 

TGCTGTTGTAGAAACTCTAGTCAACCATAGGAAAGAGCTAGCGCTGAAC^ 

CTAGCCATGCTTGGGAGCTCTGAGGACAACACGGCATTGTCACGGGCACTCTCCCAGCTGGCTGAGGTGG 
AAGAAAAAATTGAGCAGCTCCACCAGGAACAGGCCAACAATGACT 

TGACTAC^TTCGCCTCCTGGCCATAGTCCGCGCTGCCTTCGACCAGCGCATGAAGACATGGCAGCGCTG 
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CAGGATGCCCAAGCCAC^CTGCAGAAGAAGCGGGAGGCCGAGGCTCGGOT 

ATAAGCTGCAGCAGGCCAAGGACGAGATCCTCGAGTGGGAGTCTCGGGTGACTCAATATGAAAGGGACTT 
CGAGAGGATTTCAACAGTGGTCCGAAAAGAAGTGATACGGTTTGAGAAAGAGAAATCCAAGGACTTCAAG 

AACCA&GTGATCAAGTACCTTGAGACACTCCTOT 
TCCTTCCTGAGGCAAAGGCCATCTCCTAG 

Human SNX1 raKNA sequence - var4 (public gi: 4884359) (SEQ ID NO: 155) 

-"•^GGTTGCTTTGTTAkGTTCCATCTAATG 
TGTCTCCTCTGGTTTCAGAACTTCCT^ 

AGGGCTTGTGCATTTTTGTAAAGAGCTTGCACGTGTGGAAATCAAGTAGGCCAGTAGTGGGTTAGGGG 
CTGAGCCAGAAGCCTCTAGAAGGAATAACAGGAGCAC^ 

AGGAGGGAGGACTTTGTGGAGAACCTGATGCTTGAACTGAGTCTAAAAGGTGTAAAAGTGTTTGTTGCTT 
CTGCCTCCCTGTCTGTCTGGCAGGGTGAGGTAGGCGCATCTAGGGAAATGTCAAGTGGCTTGGTGTAGGG 
TAAAGTCAGTGAGGCCCATGGAGAAAAACGAGCAGGAGCCACATCAC^ 
AGGCGCTTTCGACATTACCATTGTCGCTTCGTGATCTGGA 

AAGAGCAGCCAGGCTTATCCCTACCCTCAGGAGAGCrGAAAAGGGCAGGTATGGTGGGGCCAGAGCTCAG 

GAGAGTTTCGGAACCACTGAGATCGGTCCTTGATTTGATGAGAGGCTTGAGGGGAGAGGGAGGTAGCTAG 

GATGCCCCGCAAGCTTCTGGCCCAGACACTGGGCAGACAATGAAACCCTTTGTAACACATGAGG 

GTTTGGGGCAGATGGGAGGGGAAGCAGTGGTGGGGGCAGTGAGTGOTGAAGGTGTTTTAAGAAGCGGCTC 

TGGGCCAGGCACAGTGGCTTATGCCTGTATTCCTAGCATTTTGGGAGGCCGAGGTGGGAGAATCACTTGA 

GCCGAAGAATTTGAGACCAGCCTGGGGAATATAGTGAGACCCTGTC^ 

TGGGTGTGTGGTGGTGCATGCCTGTAGGCCCAGCTACGCGGGAACATCACCTGAGCCCAGGAGGTTGAGG 
TTGCAGTGAGCTACAGTTGCGCCACTGCACTCCAGCCTAGGTG^ 

AAAACAGCTCTGGATGGGAAGGGAGGCCAGTTGCTTTAAGTAGGGGAGATAGAGTTAAAGGAGGCTTTGT 
TTTATTTAAAGGTGGGACAAACTTAAGCATGTTAAT 

AGCCATGTTTGGAAGAAAATGGGGTCCAGAGCAC^GGAAGGGGACCTGTGT^ 

TGAGGCCACAGGAAAGAATCTGTAGGTGGAGGGGAGGCCGAAGAGGGGAAGTTTCATGCTTGATAATTAA 
AATTTTCTGAGATAGGAATGTCATATTTACCTATTTAAGCCAAGTTTTTTTAGATAAAAGGTATGGAACC 
TGCTTTCCCCTTGGCTAGTTCAGCGTTTGGGCTCCGGAGTGCTGAAGATGAGGACTGGACTTCGAGCTGG 
TGTGATCCCAGTATTCAGTGTCAGTACTCAGTGACAAAATAAATGAGAGAAACGGGAATAAGAATTGTCG 
CCTACACAAAAATACCAGCAACTGTTAACT 

GAACCCTAAAAAGTCTTTGAAGCAACTCAAGTTTTAAAAAAGGGGAGGAACTCCTG 

GGGCCAAGATGTGGCTGGAGAGTGTGTGGTGATGGAGGGCGTGTCTTTTGCCGAGCACACTCAGGGCCCA 

CXSGGAAGCCCATAGACTTCAAGGACATG^GCCCCAAGGTGGTGGGATTTTCCCCACCAGTACT 

CCTAGGGGGAAGGGGAGGGCGGGAGAAGATAATGGGGATCCCTGGCTCCAAACATAGGAGGACACATCTG 

TGCTACAGTGCGCACATGCCTGGATGTACACTCTGTCTTTGGAGACACTGGCTAAGATTCTCTGCrCCAT 

GTTTGGACAGGGTCGTGCCTGATCTGAGATAAATGGACAAGAACAACTGAAGCCTGTCTTCTGGTGCATG 

TGTC^CCTGCCGATAACTGCATCTTGTGATAAAGOT 

TGGGGTCTTACGCCTTTATAACTCCATGGGCCCCAGCAAAGGTTCAGGCTCAAAACAGGTGTCAAATAGA 

TAACTGTTGAATGATTGTTCCCCAGTTGCAGGCTCTGCCACCTGGCGTTCATACTGTCTGTGAAAGGACC 

CAGCTCACCTTTCCCTCTTTATCTCCCAGTCCTTCCCAACAGCGCCGACACCTCATGGAAACTGATTGCA 

AATGTGCTACTTCTCACTTCTGTGTGGCCCGAGGAGGCTGGGTTAATGCTGGGCTTGGTACCTTAAGCAC 

CCTTTCTCCCTTCCCCATCTTCATTCTCAGAATTACACCTGTCTGAAGCAGGCATTTTCCAATGCCCTAG 

ATGGGAATATAAGTGTAAGGAGATGTGAAGCATTTGCCTGTGT 

AGGCACTTCTAGAAACCAAATCCTTGAAGATGACT 

GCAAAAACTGAAGCCAATTGAAATGTCCATCAGGAGGGGATTAAATGAATTATGGTACAGTTACA 

GAATATTTTACAGCCATTGAAGATGATATATAGCTATATTCATTGACAAGGAAAACTCATATTTTTTAGT 

GAAAAAAGCAGGTTATAGAATTGCATGATATTC^CA 

ATTGAATTGTTAATAACTATGGTCACCTCTAGAGATGGAAGTTTGCATTACCTTTAATTTTTAATACCAT 

TTTGTATTGCTTAAAATTTGTATGTATTAT^ 

GGGAAAAAAAAAAAAAAAAA 

Human SNX1 mRNA sequence - var5 (public gi: 4406620) (SEQ ID NO: 156) 

ATAAAAGGTATGGAACCTGCTTTCCCCTTGGCTA 

ACTGGACTTCGAGCTGGTGTGATCCCAGTATTCAGTGTCAGTACTCAGTGACAAAATAAATGAGAGAAAC 
GGGAATAAGAATTGTCGCCTACACAAAAATACCAGCAACTGTTAACTCTTCCCAGAAGATTTTCATTCTG 
AATGCTCCTGTAGCTAGGAACCCTAAAAAGTCTTTGAAGCAACTCAAGTTTTAAAAAAGGGGAGGAACTC 
CT^GAAATCTCAGGATGGGGCCAAGATGTGGCTGGAGAGTGTGTGGTGATGGAGGGCGTGTCTTTTGCCG 
AGCACACTCAGGGCCCACGGGAAGCCCATAGACTTCAAGGACATCAAGCCCCAAGGTGGTGGGATTTTCC 
CCACCAGTACTTGGCAGCCTAGGGGGAAGGGGAGGGCGGGAGAAGATAATGGGGATCCCTGGCTCCAAAC 
ATAGGAGGACACATCTGTGCTACAGTGCGCACATGCCTGGATGTACACTCTGTCTTTGGAGACACTGGCT 
AAGATTCTCTGCTCCATGTTTGGACAGGGTCGTGCCTGATCTGAGATAAATGGACAAGAACAACTGAAGC 
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CTGTCTTCTGGTGCATGTGTCAC CTG CCGATT^ACTGCATCTTGTGATAAAGTTGGGTGATTTACAGTCTC 

CACCAAATGCTAAACTCTGGGGTCTTACGCCTTTATAACTCCATGGGCCCCAGCAAA 

AACAGGTGTCAAATAGATAACTGTTGAATGATTGTTCCCCAGTTGCAGGCTCTGCCACCTGGCGTTC^ 

CTGTCTGTGAAAGGACCCAGCTC^CCTTTCC^ 
C^TGGAAACTGATTGCAAATGTGCTACTTCT^ 
- CTTGGTACCTTAAGCACCCTTTCTCCCTTCCCCATCTTCATTCTCAGAATTACACCT 

ATTTTCCAATGCCCTAGATGGGAATATAAGTGTAAGGAGATGTGAAGCAT 

TCACTGAGGATCCTCATAGGCACTTCTAGAAACCAAATCCTTGAAGATGACTAACCAGAAATGCCCGTCA 
T AGCACTGTTTACAGTTG CAAAAACTGAAGCCAATTGAAATGTC CATCAGGAGGGGATTAAATGAATTAT 
GGTACAGTTACACCGTTGAATATTTTACAGCCATTGAAGAT^ 

AACTCATATTTTTTAGTGAAAAAAGCAGGTTATAGAATTGCATGATATTCACATTTATATAAAACTTTAT 
ATATGGGAAGGATGTTGATTGAATTGTTAATAACTATGGTCACCTCTAGAGATGGAAGTTTGGATTACCT 
TTAATTTTTAATACCATTTTGTATTGCTTAAAATTTGTATGTATTATCGTTAAAATAAGAAAAATCAAAT 
AAAGCTATTTTCATTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAi^ 

Human SNX1 mRNA sequence - var6 (pubHc gi: 34535422) (SEQ ID NO: 157) 

TTTCCGCCGCGGGTGGAAGAAGATGGCGTCGGGTGGTGGTGGCTGTAGCGCTTCGGAGAGACTGCCTCCG 
CCCTTCCCCGGCCTGGAGCCGGAGTCCGAGGGGGCGGCCGGGGGATCAGAACCCGAGGCTGGGGACAGCG 
ACACCGAGGGGGAGGACATTTTCACCGGCGCCGCGGTGGTCAGTAAACATCAGTCTCCAAAGATAACTAC 

ATCCCTTCTTCCGATCAACAATGGCTCCAAAGAAAATG 

GATCTCTTTGCAGATGCCACAGTGGAGCTATCCTTGGACAGCACACAAAATAATCAGAAGAAGGTGCTAG 
CCAAAACACTCATTTCTCTTCCTCCTCAGGAAGCCACAAATTCTTCGAAGCCCCAGCCAACCTATGAGGA 
GCTAGAGGAAGAAGAACAGGAGGATCAATTTGATTTGACAGTCGGTATAACTGATCCTGAGAAGATAGGG 
GATGGTATGAATGCATATGTAGCCTACAAAGTTACAACAC^ 

AGTTTGCAGTAAAAAGAAGATTTAGTGACTTTCTGGGTCTTTATGAGAAGCTTTCCGAGAAGCACTCTCA 

GAATGGCTTCATTGTCCCTCCACCCCCGGAGAAGAGCCTCATAGGGATGACAAAAGTGAAAGTTGGGAAG 

GAAGATTCTTCTTCTGCAGAATTTCTTGAAAAACGGAGGGCCGCTTTAGAAAGGTACCTTCAGAGGATTG 

Ti\AATCATCCTACCATGTTACAGGACCCTGACGTCAGAGAGTTCTTGGAAAAAGAAGAGCTGCCACGTGC 

CGTGGGTACCCAGACATTGAGTGGTGCTGGTCTCCT'CAAGATGTTCAACAAAGCCACAGATGCCGTCAGC 

AAAATGACCATCAAGATGAATGAATCAGACATTTGGTTTGAGGAGAAGCTCCAGGAGGTAGAGTGTGAGG 

AGCAGCGCTTACGGAAACTGCATGCTGTTGTAGAAACTCTAGTCAACCATAGGAAAGA 

CACAGCCCAGTTTGCAAAGAGTCTAGCCATGCTTGGGAGCTCTGAGGACAACACGGCATTGTCACGGGCA 

CTCTCCCAGCTGGCTGAGGTGGAAGAAAAAATTGAGCAGCTC^ 
TCCTCCTTGCTGAGCTCCTGAGTGACTACATTC^ 

CATGAAGACATGGCAGCGCTGGCAGGATGCCCAAGCCACACTGCAGAAGAAGCGGGAGGCCGAGGCTCGG 

CTGCTGTGGGCCAACAAGCCTGATAAGCTGCAGCAGGCCAAGGACGAGATCCTCGAGTGGGAGTCTCGGG 

TGACTCAATATGAAAGGGACTTCGAGAGGATTTCAACAGTGGTCCGAAAAGAAGTGATACGGTTTGAGAA 

AGAGAAATCCAAGGACTTCAAGAACCACGTGATCAAGTACCTTGAGACACTCCTTTGCTCACAGCAGCAG 

GCTGGGGAGCAGTTGGGAATCAGGTCTGGAATACTCCTAACCAAGAAG 

TOTCTACCGTTCACAAGTTTTGTGCTGCTGCTTCC^ 

CAGCTACCTGTTCTGAGGGTCTCAATCTGTTTCGTATTCCCACTTCTTTAGGGAAGG^ 

TCTCTTAAAATAAGAGGAGCAAAATCTATTAAAACCT^ 

TCTCTTTTTTTTTTTTTTTTTGGTGTCCCT 

CCCTAAGCTGCTCAGGGCTCTCTGAGTCTTGCCCTCTGATGGCAAGTCTTATATATAACTAAACCTATTT 
TTGTCACCCATGAAAACACATCCTCAGTAGACTGTGTGAAGGTGTGAAGGTCTGATAATGACTTGATGCT 
TTATCTCCATAGAC^TGAAAGCC^TGCCCT 

GGAGGTTCAAAGAAGCTCTACTGTCTGTGCCCAGGAGGTAGCCTGCCAGCAAGAGCCCTG^ 

CACACAGCCAAAGGGTGTTCACACAGATCTCTGCCX5GTCTA 

AGGATGGGCTTCCCCCTTAGCTGTGTCGAC^GCTGCTC^ 

CCTTCGTGAGCTCGAGCTGCTGACTGCCACTATGGGAGCCT 

CGCTCCTGCCACTGGGGCCCCAGGTCCTGCTGTATCAGTTCTCTTTGGTGGG 

AGGCAACCTGAGACAAGAAAACGC7VGTAAAC7VTTCT 

GGCCAGTGGTGCAAGTATTTCTTTTTAAC^GGTGAAGTTTTTGGAAAAAGTCACTCTCCCTACCCCT 
TATCCTTACCATCAACTTTGGTTTTATCCTTCCAGT 

AATATACACATAAAGTATTTTGTATCCTGCTTTTATCATTC^CATTGTACATGTTATAAGCATTTTAOT 
ATATTGTTATATATCTTCACAAAGTTGATCTGTAAAGCTGTGTAATTTGAAGGCATCCATAGGGTGACT 
TACCATAATTTTGATTCATCCCTTGTTGTTGGATTCTTGGTCAGGGGTT 
ACTTTAAAATTTTGAATACAATTTCAGATTTACAGAAAAGTTGCAGGAATATCA 

ATCTTTTATCCAGATTTACTGAGTGTTTACATTTTATCCCATTTGCCTTATCTATATTTCATGTTGCATT 

TTCTTAATCATTTGAGAATAATTTGCACAGATACCCCATTATGCCCAAAACAGTATGCATTO 

ACAGGACATTCTCTTCTAAGAGAAGAAGAGAATTACTTTA^ 

TATCAAAATCAGGAAGTTTAAGAGTGATTTAATACTGTTATCTAACCCATGATTCATATTTAAATTTTGC 
CATTTATCCCAATAATGTCCTTTGTAGCCATTCTTTTACCTTGTGCAGGATC^TGTTACATTTGTAAACG 
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TGTGTCTCTC^TACTGCAGATTCCTCAACTTT^^ 
CAGGCTATTTTGTCG 

Human SNX1 raRNA sequence - var7 (public gi: 38197125) (SEQ ID NO: 158) 

GTGGAAGAAGATGGCGTCGGGTGGTGGTGGCTGTAGCGCTTCX3GAGAGACTGCCTCCGCCCTTCCCCGGC 

CTGGAGCCGGAGTCCGAGGGGGCGGCCGGGGGATCAGAACCCGAGGCTGGGGACAGCGACACCGAGGGGG 

AGGACATTTTGACCGGCGCCGCGGTGGTCAGTAAAC^ 

CATCAACAATGGCTCCAAAGAAAATGGGATCGATGAAG 

GATGCCACAGTGGAGCTATCCTTGGACAGCACACAAAATAATC^^ 

TTTCTCTTCCTCCTCAGGAAGCCACAAATTCTTCGAAGCCCCAGCC^^CCTATGAGGAGCTAGAGGAAGA 
AGAACAGGAGGATCAATTTGATTTGAC^GTCGGTATAACTGATCCTGAGAAGATAGGGGATGGTATGAAT 
GCATATGTAGCCTACAAAGTTACAACACAGACAAGCTTACCAT^ 

AAAGAAGATTTAGTGACTTTCTGGGTCTTTATGAGAAGCTTTCCGAGAAGCACTCTCAGAATGGCTTCAT 
TGTCCCTCCACCCCCGGAGAAGAGCCTCATAGGGATGACAAAAGTGAT^GTTGGGAAGGAAGATTCTTCT 
TCTGCAGAATTTCTTGAAAAACGGAGGGCCGCTTTAGAAAGGTACCTTCAGAGGATTGTAAATCATCCTA 
CCATGTTACAGGACCCTGACGTC^GAGAGTTCTTGGAAAAAGAAGAGCTGCCACGTGCCGTGGGTACCCA 

GACATTGAGTGGTGCTGGTCTCCTCAAGATGTTCA^ 

AAGATGAATGAATCAGACATTTGGTTTGAGGAGAAGCTCCAGGAGGTAGAGTGTGAGGAGCAGCGCTTAC 

GGAAACTGCATGCTGTTGTAGAAACTCTAGTCAACCATAGGAAAGAGCTAGCGCTGAACAQ 

TGCAAAGAGTCTAGCCATGCTTGGGAGCTCTGAGGACAACACGGCATTGTCACGGGCACTCTCCCAGCTG 

GCTGAGGTGGAAGAAAAAATTGAGCAGCTCCACCAGGAACAGGCCAACAATGACTTCTTCCTCCTTGCTG 

AGCTCCTGAGTGACTACATTCGCCTCCTGGCCATAGTCCGCGCTGCCTTCGACCAGCGCATGAAGACATG 

GGAGCGCTGGCAGGATGCCCAAGCCACACTGCAGAAGAAGCGGGAGGCCGAGGCTCGGCTGCTGTGGGCC 

AACAAGCCTGATAAGCTGCAGCAGGCCAAGGACGAGATCCTCGAGTGGGAGTCT 

AAAGGGACTTCGAGAGGATTTCAAGAGTGGTCCGAAAAGAAGTGATACGGTTTGAGAAAGAG 

GGACTTCAAGAACCACGTGATCAAGTACCTTGAGACACTCCTTTACTGACAGCAGCAGCTGGGAAAG 

^GGGAAGCCTTCCTTCCTGAGGCAAAGGCCATCTCCTAATGGACCAAGGACCCCAGAGCCCACCTGTGTG 

ACGCTGCCTTTTTATACACTGTCCTCCTCCACCTTGATGGACCCCTAGTGATGCATCCTGCCTAGGCTGG 

ACTTAACCCCTTCCTCCCTGTCCCCACGACCAACTGTCCCCAGTTACTCT 

TCCATATATATTTTCTTACCTAAGAGAATAGTTTCCTGCTTTAAGCAAAAGACCTACAATAGGTGGTGGA 
ATTATGGGATGGGGTGGAGTATTGATATAAATATATAAATACAAATGTATATTTTTC^GGATGTGGTTTA 
GGAACTGGGAATAACGTTTTCTGTTACTCCTGATGGTGCCATGAAAAGGTTATGTAATAAAATATTTTAA 

AATCAAAAAAAAAAAAAAAAAA 

Human SNX1 mRNA sequence - var8 (public gi: 23111033) (SEQ m NO: 159) 

GGGTGGAAGAAGATGGCGTCGGGTGGTGGTGGCTGTAGCGCTTCGGAGAGACTGCCTCCGCCCTTCCCCG 

GCCTGGAGCCGGAGTCCGAGGGGGCGGCCGGGGGATCAGAACCCGAGGCTGGGGACAGCGACACCGAGGG 

GGAGGACATTTTCACCGGCGCCGCGGTGGTCAGTAAACATCAGTCTCCAAAGATAACTACATCCCTTCTT 

CCCATCAAC^TGGCTCCAAAGAAAATGGGATCCATGAAGAACAAGACCAAGAGCC^C^GGATCTCTTT^ 

CAGATGCCACAGTGGAGCTATCCTTGGACAGCACACAAAATAATCAGAAGAAGGTGCT 

CATTTCTCTTCCTCCTCT^GGAAGCCACAAATTCTTCGAAGCCCCAGCCAACCTATGAGGAGCTAGAGGAA 

GAAGAACAGGAGGATCAATTTGATTTGACAGTCGGTATAACTGATCCTGAGAAGATAGGGGATGGTATGA 

ATGCATATGTAGCCTACAAAGTTACAACACAGACAAGCTTACCAT^ 

AAAAAGAAGATTTAGTGACTTTCTGGGTCTTTATGAGAAGCTTTCCGAGAAGCACTCTCAGAATGGCTTC 

ATTGTCCCTCCGCCCCCGGAGAAGAGCCTC^TAGGGATGACAAAAGTGAAAGTTGGGAAGGAAGATTCTT 

CTTCTGC^GAATTTCTTGAAAAACGGAGGGCCGCTTTAGAAAGGTACCTTCAGAGGATTGTAAATCATCC 

TACCATGTTACAGGACCCTGACGTCAGAGAGTTCTTGGAAAAAGAAGAGCTGCCACGTGCCGTGGGTA 

CAGACATTGAGTGGTGCTGGTCTCCTCAAGATGTTCAAGAAAGCGACAGATGCC 

TCAAGATGAATGAATCAGACATTTGGTTTGAGGAGAAGCTCC^GGAGGTAGAGTGTGAGGAGCAGCGCTT 
ACGGAAACTGCATGCTGTTGTAGAAACTCTAGTCAACCATAGGAAAGAGCTAGCGCTGAACACAGCCCAG 
TTTGCAAAGAGTCTAGCCATGCTTGGGAGCTCTGAGGACAA^ 

TGGCTGAGGTGGAAGAAAAAATTGAGC^GCTCCACCAGGAACAGGCCAACAATGACTTCT^ 

TGAGCTCCTGAGTGACTACATTCGCCTCCTGGCCATAGTCCGCGCTGCCTTCGACCAGCG 

TGGCAGCGCTGGC^GGATGCCCAAGCCACACTGCAGAAGAAGCGGGAGGCCGAGGCTCGGCTGCTGTGGG 

CCAACAAGCCTGATAAGCTGCAGCAGGCCAAGGACGAGATCCTCGAGTGGGAGTCTCGGGTGACTCAATA 

TGAAAGGGACTTCGAGAGGATTTCAACAGTGGTCCGAAAAGT^GTGATACGGTTTGAGAAAGAGAAATCC 

AAGGACTTC7^GAACCACX5TGATCAAGTACCTTGAGACACTCCTTTAOT 

ACTGGGAAGCCTTCCTTCCTGAGGCAAAGGCCATCTCCT^ 

TGACGCTGCCTTTTTATACACTGTCCTCCTCCACCTTGAT 

GGACTTAACCCCTTCCTCCCTGTCCCCACGACCAACTGTCCCCAGTTACTCTAACCGTTATOT 

CTTCCATATATATTTTCTTACCTAAGAGAATAGTTTCCTGCTTTAAGCAAAAGACCT 

GAATTATGGGATGGGGTGGAGTATTGATATAAATATATAAATACAAATGTATATTTTTCAGGATGTGGTT 
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TAGGAACTGGGAATAACGTTTTCTGTTACTCCTGATGGTGCCATGAAAAGGTTATGTAATAAAATATTTT 
AAAATCAAAAAAAAAAAAAAAAAA 

Hiunan SNX1 raKNA sequence - var9 (public gi: 23111035) (SEQ ID NO: 160) 

GGGTGGAAGAAGATGGCGTCGGGTGGTGGTGGCTGTAGCGCTTCGGAGAGACTGCCTCCGCCCTTCCCCG 
GCCTGGAGCCGGAGTCCGAGGGGGCGGeCGGGGGATCAGAACCCGAGGCTGGGGACAGCGACACCGAGGG 
RGAGGACATTTTCACCGGCGCCGCGGTGGTCAGTAAACAT^ 

CCCATCAACAATGGCTCCAAAGAAAATGGGATCCATGAAGAACAAGACCAAGAGCCACAGGATCTCTTTG 

CAGATGCCACAGTGGAGCTATCCTTGGACAGCACAC^AAATAM 

CATTTCTCTTCCTCCrrCAGGAAGCCACAAATTCTTCGAAGCCCCAGCCAA 

GAAGAACAGGAGGATCAATTTGATTTGACAGTCGGTATAACTGATCCTGAGAAGATA 

ATGCATATGTAGCCTACAAAGTTACAACACAGAGA^ 

AAAT^GAAGATTTAGTGACTTTCTGGGTCTTTATGAGAAGCTTTCCGAGAAGCACTCTCAGAATGGCTTC 

ATTGTCCCTCCGCCCCCGGAGAAGAGCCTCATAGGGATGACAAAAGTGAAAGTTGGGAAGGAAGATTCTT 

CTTCTGCAGAATTTCTTGAAAAACGGAGGGCCGCTTTAGAAAGGTACCTTCAGAGGAOT^ 

TACCATGrTACAGGACCCTGACGTCAGAGAGTTCTTGGAAAAAGAAGAGCT 

CAGACATTGAGTGGTGCTGGTCTCCTCAAGATGTTCAACAAAGC 

TC^GATGAATGAATCAGACATTTGGTTTGAGGAGAAGCTCCAGGAGGTAGAGTGTGAGGAGCAGCGCTT 
ACGGAAACTGCATGCTGTTGTAGAAACTCTAGTCAACCATAGGAAAGAGCTAGCGCTGAACACAGCCCAG 
TTTGC^AAGAGTCTAGCCATGCTTGGGAGCTCTGAGGACAACA^ 

TGGCTGAGGTGGAAGA7U\AAATTGAGCAGCTCCACCAGGAACAGGCCAACAATGACTTCTTCCTCCTTGC 
TGAGCTCCTGAGTGACTACATTCGCCTCCTGGCCATAGTCCGCTGGGAGTCTCGGGTGACTCAATATGAA 
AGGGACTTCGAGAGGATTTO\A<^GTGGTCCGAAAAGAAGT^^ 
ACTTCAAGAACCACGTGATCAAGTACCTTGAGACACTCCT 

GGAAGCCrrCCTTCCTGAGGCAAAGGCCATCTCCTAATGGACCAAGGACCCCAGAGCCCACCTGTGTGAC 
GCTGCCTTTTTATACACTGTCCTCCTCCACCTTGATGGACCCCTAGTGATGCATCCTGCCTAGGCTGGAC 
TTAACCCCTTCCTCCCTGTCCCCACGACCAACTGTCCCCAGTTACTCTAACCGTTATTTCATTTAGCTTC 
C^ATATATATTTTCTTACCTAAGAGAATAGTTTCCTGCTTTAAGCAAAAGACCTACAATAGGTGGTGGAAT 
TATGGGATGGGGTGGAGTATTGATATAAATATATAAATAGAAATGTATATTTTTCAGGATGTGGTTTAGG 
AACTGGGAATAA.CGTTTTCTGTTACTCCTGATGGTGCCATGAAAAGGTTATGTAATAAAATATTTTAAAA 

TCAAAAAAAAAAAAAAAAAA 

Human SNX1 raKNA sequence - varlO (public gi: 23111031) (SEQ ID NO: 161) 

GGGTGGAAGAAGATGGCGTJCGGGTGGTGGTGGCTGTAGCGCTTCGGAGAGACTGCCTCCGCCCTTCCCCG 

GCCTGGAGCCGGAGTCCGAGGGGGCGGCCGGGGGATCAGAACCCGAGGCTGGGGACAGCGACACCGAGGG 

GGAGGACATTTTCACCGGCGCCGCGGTGGTCAGTAAACATCAGTCTCCAAAGATAACTACATCCCT 

CCCATCAACAATGGCTCCAJ\AGAAAATGGGATCCATGAAGAACAAGACCAAGAGCCACAGGATCTCTTTG 

CAGGGGATGGTATGAATGCATATGTAGCCTAC^y^GTTAC/^CACAGACAAGCTTACCATTGTTCAGAAG 

CAAACAGTTTGCAGTAAAAAGAAGATTTAGTGACTTTCTGGGTCTTTATGAGAAGCTTTCCGAGAAGCAC 

TCTCAGAATGGCTTCATTGTCCCTCCGCCCCCGGAGAAGAGCCTCATAGGGATGACAAAAGTGAAAGTTG 

GGAAGGAAGATTCTTCTTCTGC^GAATTTCTTGAT^AAACGGAGGGCCGCTTTAGAAAGGTACCTTCAGAG 

GATTGTAAATCATCCTACGMGTTACAGGACCCT 

CGTGCCGTGGGTACCCAGACATTGAGTGGTGCTGGTCTCCT^ 

TCAGCAAAATGACCATCAAGATGAATGAATGAGACACT 

TGAGGAGC^GCGCTTACGGAAACTGCATGCTGTTGTAGAAACTCTAGTCAACCATAGGAAAGAGCTAGCG 
CTGAACACAGCCCAGTTTGCAAAGAGTCTAGCG^ 

gggcactctcccagctggctgaggtggaagaaaaaattgagcagctccaccaggaac^ 
cttcttcctccttgctgagctcctgagtgactacattcgcctcctggccatagtccgcgctgccttcgac 
cagcgcatgaagacatggcag cgctggcaggatgcccaagccacactgcagaagaagcgggaggc cgagg 
ctcggctgctgtgggccaacaagcctgataagcto 

tcgggtgactcaatatgaaagggacttcgagaggatttcaacagtggtccgaaaagaa 

gagaaagagaaatccaaggacttcaagaaccacgtgatcaagtaccttgagacactcctttactcacag 

agcagotggc^^gtactgggaagccttcc^ 

cagagcccacctgtgtgacgctgcctttttatacact^ 

catcctgcctaggctggacttaacccct^cctccctgtccccacgaccaaotgt 

cgttatttcatttagcttcc^tatatatt^ 

ctacaataggtggtggaattatgggatggggtggagtattgatataaatatataaatac7u\atgtata^ 

tttcaggatgtggtttaggaactgggaataacgttc 

gtaataaaatattttaaaatcaaaaaaaaaaaaaaaaaa 



Human SNX1 protein sequence - varl (pubHc gi: 23111032) (SEQ ID NO: 281) 
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MASGGGGCSASERLPPPFPGLEPESEGAAGGSEPEAGDSDTEGEDIFTGAAWSKHQSPKITTSIiLPINN 
GSKENGIHEEQDQEPQDLFAGDGMNAYVAYKVTTQTSLPLFRSKQFAVKI^FSDFIiGLyEKLSEKHSQNG 

FIVPPPPEKSIiIGOTKVKVGKEDSSSAEFLEKRRAALERYM^ 

TQTIiSGAGLLKMFNKATDAVS KMTI KMNESDI WFEEKLQEVECEEQRLRKLHAVVETIjVNHRKEIjALNTA 
QFAKSLAMLGSSEDNTALSRALSQLAEVEEKIEQLHQEQ 

TWQRWQDAQATLQKKREAEARLLWANKPDKLQQAKDE I LEWES RVTQYERDFERI STWRKEVIRFEKEK 
SKDFKNHVIKYLETLLYSQQQLAKYWEAFLPEAKAIS 

Human SNX1 protein sequence - var2 (public gi: 23111036) (SEQ ID NO: 282) 

MASGGGGCSASERLPPPFPGIiEPESEGAAGGSEPEAGDSDTEGEDIFTGAAWSKHQSPKITTSLLPINN 

GSKENGIHEEQDQEPQDIiFADATVEIjSIiDSTQNNQKKVLAK^ 

DQFDLTVGITDPEKSGDGMNAYVAYKVTTQTC^ 

PPEKSLIGMTKVKVGKEDSSSAEFLEKRRAALERYLQRIVNHPTMI^ 

gagllkmfnkatdavskmtikmnesdiwfeek^ 

IJU^SSEDNTALSRAIiSQLAEVEEKIEQ 
RISTVVRKEVIRFEKEKSKDFKNHVIKYLETLL^^ 

Human SNX1 protein sequence - var3 (public gi: 12653179) (SEQ ID NO: 283) 

MASGGGGCSASERLPPPFPGLEPESEGAAGGSEPEAGDSDTEGEDIFTGAAWSKHQSPKITTSIiLPINN 

gskengiheeqdqepqdlfadatvelsldstqnn^^ 

DQFDLTVGITDPEKIGDGmAYVAYKVTTQTSLPbFRSKQFAVKRRFSDFIiGLYEKLSEKHSQNGFIVPP 
PPEKSLIGMTKVKVGKEDSSSAEFLEKRRAALERYLQRIVNH^ 
GAGLLKMFNKATDAVSKMTIKMNESDIWFEEKLQETO 
IiAMIiGSSEDNTALSRAIiSQLAEVEEKIEQbHQEQANNDFFLLAELLSDY^ 

QDAQATLQKKREAEARLLWANKPDKIjQQAKDEI IiEWESRVTQYERDFERI STWRKEVIRFEKEKSKDFK 
NHVIKYLETLIiYSQQQLAKYWEAFIiPEAKAIS 

Human SNX1 protein sequence - var4 (public gi: 34535423) (SEQ ID NO: 284) 

MASGGGGCSASERLPPPFPGLEPESEGAAGGSEPEAGDSDTEGEDIFTGAAWSKHQSPKITTSIiLPINN 
GSKENGIHEEQDQEPQDLFADATVELSLDSTQNNQK^ 

DQFDLTVGITDPEKIGDGMNAYVAYKVTTQTSLPLFRSKQFAVKRRFSDFLGIjYEKLS^ 

PPEKSIiIGOTKVKVGKEDSSSAEFLEKRRAALERYLQRIVNHPT^ 

GAGLLKMFNKATDAVSKMTIKMNESDIWFEEKLQEVE 

IiAMLGSSEDNTALSRAIjSQIjAEVEEKIEQLHQEQANNDFFLIiAELLSDYIRIiIA 

qdaqatlqkkreaearllwankpdklqqa 
nhvikyletllcsqqqageqlgirsgilltkklprysctfstv^ 

Human SNX1 protein sequence - var5 (public gi: 3152942) (SEQ ID NO: 285) 

MASGGGGCSASERLPPPFPGLEPESEGAAGGSEPEAGDSDTEGEDIFTGAAWSKHQSPKITTSLIjPINN 

GSKENGIHEEQDQEPQDLFAGDGMNAWAYKVTTQTSLPLFRSKQFAVKRRFSDFIX5LYEKLSEKHSQNG 

FIVPPSPEKSLIGMTKVKVGKEDSSSAEFLEKRRAALERYL^ 

TQTLSGAGLLKMFNKATDAVSKMTIKMNESDIW 

QFAKSLAMLGSSEDNTALSRALSQIjAEVEEKIEQLHQEQAI^ 

TWQRWQDAQATIjQKKIIEAEARIjLWANKPDKLQ^ 

skdfknhvikyletllysqqqlakyweaflpeakai s 

Human SNX1 protein sequence - var6 (public gi: 3152940) (SEQ ID NO: 286) 

MASGGGGCS AS ERLP P P FPGLEPE SEGAAGGS E PE AGDSDTEGED I FTGAAWSKHQS PKI TTSLLP INN 

GSKENGI HEEQDQEPQDLFADATVEIiSLDSTQNNQKKVIiAKTLI SLS PQEATNSS KPQPTYEELEEEEQE 

IX2FDLTVGITDPEKIGDGMNAYVAYKVTTQTSL 

SPEKSLIGMTKVKVGKEDSSSAEFLEKRRAAIiERYLQRIVNHPTMI^ 

GAGLLKMFNKATDAVS KMTI KMNESDI WFEEKLQEVECEEQRLRKLHAVVETLVNHRK^ 

LAMLGSSEDNTALSRAIiSQLAEVEEKIEQLHQE 

QDAQATIiQKKREAEARLLWANKPDKLQQAKDE I LEWESRVTQYERDFERI STVVRKEVI RFEKEKS KDFK 
NHVI KYLETLLYSQQQLAKYWEAFLPE AKAI S 



Unigene Name; SNX3 Unigene ID: Hs. 12102 



Human SNX3 mRNA sequence - varl (public gi: 23111040) (SEQ ID NO: 162) 



Figure 36 part - 90 



IP C T/ U S O *W O 6 3 O 8 

CTGTCTGCGACCCCGAGTCCC^TG^ 

TCGGCCGGGCCTCCCGGGAGCCGGGCGTGGCGTTCCAGCTAGTGAGCCGTTTCTCCCCTGGGCTCGGAGG 

CGGAAGCTTGAGGGGCGCGGGGAGGAGCTTCGCGTGCGGGGTGAACGCCCGCTCTACGTGCTCGTTCTCT 

TCGCGACCGCTGCGCGCGAGCCCCGTGTCCC^TOGCGGGCAGCAGCGGCGGCGGCGGCGGCTGAACGCG 

GAGGGGGCGGAGGGAGCCCGCGGCGGCGGCAGCAGCTACAGCGAAATGGCGGAGACCGTGGCTGACACCC 

GGCGGCTGATCACCAAGCCGCAGAACCTGAATGACGCCTACGGACCCCCCAGCAACTTCCTCGAGATCGA 

TGTGAGCAACCCGCAAACGGTGGGGGTCGGCCGGGGCCGCTTCACCACTTACGAAATCAGGGTCAAGGTC 

GTAGTTCCCCCGOTCCCTGGGAAAGCGTTTTTGCGTCAGCTTCCTTTTAGAGGAGATG 

ATGACAATTTTATTGAGGAAAGAAAACAAGGGCTGGAGCAGTTTATAAAC^GGTCGCTGGTCATCCTCT 

GGCACAGAACGAACGTTGTCTTCACATGTTTTTAC^GATGAAATAATAGATAAAAGCTATACTCCATCT 

AAAATAAGACATGCCTGAAATTTGGCAAGAAGGGGCAAAAACGTGACTATTAATGATTGATAAGCACCAG 

TGAAGAAGTTCTAACTTTTAGCATGCTGCACAGAA 

TATGCTCAGTTTTGTTTTGTTTTGGCAGTTGACAAGAAGTTAATTTGCTTTAGTAAAAATCCCTCATTCC 
AGCCTTTCTATATAAATAGCTCTTTCTTGCTGTT 
ATTCCAGTGTAATCTGCAGTGTCGTAACTAAAGTTACT 
TGTTTGCTTCTTGCATCTGATTAACTAGAATATTTC 

CCCATTTATGTGTAGGAGCACTACACCATTGGTTTCCAATACTGCACACATAAGAT 

AGAAAGTATCTTCCT CCAGGCTTGTAATACC CTTCACATGGAAGATTAATGAGGGAAATCTTTATATTCT 
GTATAAAAACAAAAGCAAATTTATATACTAAAATCATCTGTCTAAAAATTTAAGTTGTTTTCAAATAA^ 
ATTAAAATGCATTTCTGATATGCAAAAAAAAAAAAAAAAAAAAAAAAT^AAAAAAAAA 

Human SNX3 mRN A sequence - var2 (pubHc gi: 34304375) (SEQ ID NO: 163) 

GTCCGGCCGGAACCTGTTTGCGACCCCGAGTCCCATGACACCGCTTCTCCTCACACCCCAGTCCGCAGTG 
CCCCTCCCCAGCCTCGGCCGGGCCTCCCGGGAGCCGGGCGTGGCGTTCCAGCTAGTGAGCCGTTTCTCCC 
CTGGGCTCGGAGGCGGAAGCTTGAGGGGCGCGGGGAGGAGCTTCGCGTGCGGGGTGAACGCCCGCTCTAC 

GTGCTCGTTCTCTTCGCGACCGCTGCGCGCGAGCCCC^ 

GCGGCTGAACGCGGAGGGGGCGGAGGGAGCCCGCGGCGGCGGCAGCAGCTACAGCGAAATGGCGGAGACC 
GTGGCTGACACCCGGCGGCTGAT<^CCAAGCCGCAGAACCTGAATGACGCCTACGGACCCCCCAGC^CT 
TCCTCGAGATCGATGTGAGCAACCCGCAAACGGTGGGGGTCGGCCGGGGCCGCTTCACCACTTACGAAAT 
CAGGGTCAAGACAAATCTTCCTATTTTCAAGCTGAAAGAAT 

GAATGGCTGCGAAGTGAATTAGAAAGAGAGAGCAAGCCCTGCCTCAGAATGACATCAGAGGCAAGGAGTC 
ATGGAAGGACGTGGTGTGCTCAGAATGATGAAAAGTTATTTTGTGACTAGAAAGTCGTAGTTCCCCCGCT 
. CCCTGGG7VAAGCGTTTTTGCGTCAGCTTCCTTTTAGAGGAGATGATGGAATATTTGATGACAATTTTATT 
GAGGAAAGAAAACAAGGGCTGGAGCAGTTTATAAACAAGGTCGCTGGTCATCCTCTGGCACAGAACGAAC 
GTTGTCTTCACATGTTTTTACAAGATGAAATAATAGATAAAAGC^ 

CTGAAATTTGGCAAGAAGGGGCAAAAACGTGACTATTAATGATTGATAAGCACCAGTGAAGAA 
CTTTTAGCATGCTGCACAGAAACTGGTATAACATGCCTTC^^ 

AATAGCTCTTTCTTGCTGTTTTAATGTGGTGCACACTATAGCCTCAGAAACCTGTTATTCCAGTGTAATC 
TGCAGTGTCGTAACTAAAGTTACTGGCTTGGTCTTATTTGCACAGTTTTTGCGTCTTGTTTGCTTCTTGC 
ATCTGATTAACTAGAATATTTCTCTTTCCCCCTTTT^ 

GGAGCACTACACCATTGGTTTCCAATACTGCACACATAAGATACATACTTGTGTGCAGAAAGTATCTTCC 
TCCAGGCTTGTAATACCCTTC^CATGGAAGATTAATGAGGGAAATCTTTATATTCTGTATAAAAACAAAA 
GCAAATTTATATACTAAAATCATTTGTCTAAAAATTTAAGTTGTTTTCAAATAAAAATTAAAATGCATTT 
CTGATATGCAAAAAAAAAAAAAAAAAAAAAAA 

Human SNX3 mKNA sequence - var3 (public gi: 34190889) (SEQ ID NO: 164) 

TTOACCCACGCGTCCGCCC^CGCGTCCGCTGTTTGCGACCCCGAGTCCCATGACACCGCTTCTCCTCACA 

CCCCAGTCCGCAGTGCCCCTCCCCAGCCTCGGCO^CCTCCCGGGAGCCGGGCGTGGCX3TTCCAGCTAG 

TGAGCCGTTTCTCCCCTGGGCTCGGAGGCGGAAGCTTGAGGGGCGCGGGGAGGAGCTTCGCGTGCGGGGT 

GAACGCCCGCTCTACGTGCTCGTTCTCTTCGCGACCGCTGCGCGCGAGCCCCGTGTCCCCACGGCGGGCA 

GCAGCGGCGGCGGCGGCGGCTGAACGCGGAGGGGGCGGAGGGAGCCCGCGGCGGCGGC^GCAGCTACAGC 

GAAATGGCGGAGACCGTGGCTGACACCCGGCGGCTGATCACCAAGCCGCAGAACCTGAAT^ 

GACCCCCGAGGAACTTCCTCGAGATCGATC 

CACCACTTACGAAATCAGGGTCAAGACAAATCTTCCTATTTTCAAGC^ 

AGATA<^GTGACTTTGAATGGCTGCGAAGTGAATTAGAAAGAGAGAGCAAGCCCTGCCraVGAATGACAT 

(^GAGGCAAGGAGTCATGGAAGGAOSTGGTGTGCTCAGAATGATGAAAAGTTATTTTGT^ 

CGTAGTTCCCCCGCTCCCTGGGAAAGCGTTTTTGCGTCAGCTT 

GATGAGAATTTTATTGAGGAAAGAAAACAAGGG CTGGAGCAGTTTATAAACAAGGTCGCTGGTCATC CTC 
TGGCAC^GAACGAACGTTGTCTT(^CATGTTTTTACAAGATGAAATAATAGATAAAAGCTATACTCCATC 
TAAAATAAGACATGCCTGAAATTTGGGAAGAAGGGGCAAAAA 
GTGAAGAAGTTCTAACTTTTAGC^TGCTGCAC^GAAACTGGTATAACATGCCT 
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AATTAAAATGCATTTCTGATATGCAAAAAAA7VAAAAAAAAAAAAAAA 

Human SNX3 mRNA sequence - var4 (public gi: 15779011) (SEQ ID NO: 165) 

GGGGCTTCGCGACCX3CTGCGCGCX5AGCCCCGTGTCCCCACGGCGGGCAGCAGCGGCGGCGGCGGCGGCTG 
AACGCGGAGGGGGCGGAGGGAGCCCGCGGCGGCGGCAGCAGCTACAGCGAAATGGCGGAGACCGTGGCTG 

ACACCCGGCGGCTGATCACCAAGCCGCAGAACCTGA 

GATCGATGTGAGCAACCCGCAAACGGTGGGGGTCGGCCGGGGCCGCTTCACC^ 

AAGACAAATCTTCCTATTTTCAAGCTGAAAGAATCTACTGTTAGAAGAAGATACAGTGACT^ 

TGCGAAGTGAATTAGAAAGAGAGAGCAAGGTCGTAGTTCCCCCGCTCCCTGGGAA^ 

GCTTCCTTTTAGAGGAGATGATGGAATATTTGATGACAATTTTATTGAGGAAAGAAAAC^ 

CAGTTTATAAAC^GGTCGCTGGTCATCCTCTGGCACAGAACGAACGTTGTOTTCACA 

ATGAAATAATAGATAAAAGCTATACTCCATCTAAAATAAGACATGCCTGAAATTTGGCAAGAAGGGGCAA 

AAACGTGACTATTAATGATTGATAAGCACCAGTGAAGAAGTTCTAACTTTTAGCATGCTGCACAGAAACT 

GGTATAACATGCCTTCAGTATACTAACACTCATATGCT 

GTTAATTTGCTTTAGTAAAAATCCCTCATTCCAGCCTTTCTATATAAATAGCTCTTTCTTGCTGTTTTAA 

TGTGGTGCACACTATAGCCTCACA7ACCTGTTATTCCAGTGTAATCTGCAGTGTCGTAACTAAAGTTACT 

GGCTTGGTCTTATTTGCACAGTTTTTGCGTCTTGTTTGCTTCTTGCATCTGATTAACTAGAATATTTCTC 

TTTCCCCCTTTTAATTTGTGATGTCACTTGACCCCATTTATGTGTAGGAGCACTACACdATTGGTTTCCA 

ATAOTGCACACATAAGATACATACTTGTGTGCAGAAAGTATCTTCCTCCAGGCTTGTAATACCCTTCAC7V 

TGGAAGATTAATGAGGGAAATCTTTATATTCTGTATAAAAACAAAAGCAA 

TGTCTAAAAATTTAAGTTGTTTTCAAATAAAAA^ 



Human SNX3 mRNA sequence - var5 (public gi: 15929496) (SEQ ID NO: 166) , 

CGCGCGAGCCCCGTGTCCCCACGGCGGGCAGCAGCGGCGGCGGCGGCGGCTGAACGCGGAGGGGGCGGAG 
GGAGCCCGCGGCGGCGGCAGCAGCTACAGCGAAATGGCGGAGACCGTGGCTGACACCCGGCGGCTGATCA 

CCAAGCCGCAGAACCTGAATGACGCCTACGGACCCCCCA 
GCAAACGGTGGGGGTCGGCCGGGGCCGCTTC^CCACTTACGAAATC^ 

TTCAAGCTGAAAGAATCTACTGTTAGAAGAAGATACAGTGACTTTGAATGGCTGCGAAGTGAATTAGAAA 
GAGAGAGCAAGGTCGTAGTTCCCCCGCTCCCTGGGAAAGCGTTTTTGCGTCAGCTTCCTTTTAGAGGAGA 
TGATGGAATATTTGATGACAATTTTATTGAGGAAAGAAAACAAGGGCTGGAGCAGTTTATAAACAAGGTC 
GCTGGTCATCCTCTGGmCAGAACGAA.CGTTGTCTTCACATGTTTTTACAAGATGAAATAATAGATAAAA 

GCTATACTCCATCTAAAATAAGACATGCCTGAAATTTGG 
TTGATAAGC AC (^GTGAAGAAGTTCTAACTTTTAGCATG CTGCACA 

TATACTAACACTCATATGCTCAGTTTTGTTTTGTTTTGGCAGTTGACAAGAAGTTAATTTGCTTTAGTAA 

AAATCCCTCATTCCAGCCTTTCTATATAAATAGCTCTTTCTTGCTGTTTTAATGTGGTGCACACTATAGC 

CTCACAAACCTGTTATTCCAGTGTAATCTGCAGTGTCGTAACTAAAGTTACTGGCTTGGTCTTATTTGCA 

CAGTTTTTGCGTCTTGTTTGCTTCTTGCATCTGATTAACTAGAATATTTCTCTTTCCCCC 

TGATGTCACTTGACCCCATTTATGTGTAGGAGCACT^ 

ACATACTTGTGTGCAGAAAGTATCTTCCTCCAGGCTTGTAATACCCTTCACATGGAAGATTAATGAGG 
AATCTTTATATTCTGTATAAAAACAAAAGCAAATTTATATACTAAAATCATTTGTCTAAAAATTTAAGTT 
GTTTTCAAATAAAAATTAAAATG CATTTCTGATATGCAAAAAAAAAAAAAAA 

Human SNX3 mRNA sequence - var6 (public gi: 14250078) (SEQ ID NO: 167) 

AGCCCCGTGTCCCCACGGCGGGCAGCAGCGGCGGCGGCGGCGGCTGAACGCGGAGGGGGCGGAGGGAGCC 
CGCGGCGGCGGCAGCAGOTACAGCGAAATGGCGGAGACCGTGGCTGACACC 
CGCAGAACCTGAATGACGCCTAC^GACCCCCC^GCA^ 
GGTGGGGGTCGGCCGGGGCCGCTTCACGACTTACC^^ 

CTGAAAGAATCTACTdTTAGAAGAAGATACAGTGACTTTGAATGGCTGCGAAGTGAATTAGAAAGAGAGA 
GCAAGGTCGTAGTTCCCCCGCTCCCTGGGAAAGCGTTTTTGCGTCAGCTTCCTTTTAGAGGAGATGATGG 

AATATTTGATGACAATTOTATTGAGGAAAGAAAACAAGG 
CATCCTCTGGCACAGAACGAACGTTGTCTT 

CTCCATCTAAAATAAGACATGCCTGAAATTTGGCAAGAAGGGGCAAAAACGTGACTATTAATGATTGATA 
AGC^CCAGTGAAGAAGTTCTAACTTTTAGCATGCTGC^CAGAAACTGGTATAACATGCCTTCAGTATACT 




iCTTTAGTAAAAATCC 
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CTCATTCCAGCCTTTCTATATAAA^ 
AACCTGTTATTO^GTGTAATCTGCA^^^ 

TTGCGTCTTGTTTGCTTCTTGCATCTGATTAACTAGAATATTTCTCTTTCCCCCTTTTAATTTGTGATG 

CACTTGACCCCZATTTATGTGTAGGAGCACTACACCATTGGTTTCCAATACTGCACACATAAGATACATAC 

TTGTGTGC^GAAAGTATCTTCCTCC^GGCTTGTAATACCCTTCACATGGAAGATTAATGAGGGAAATCTT 

TATATTCTGTATAAAAACAAAAGCAAATTTATATACT^ 

AAATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

> Human SNX3 raRNA sequence - var7 (pubHc gi: 12957159) (SEQ ID NO: 168) 

GGGCGAGGAGGGAGCCCGCGGCGGCGGGAGCAGCTACAGCGAAATGGCGGAGACCGTGGCTGACACCCGG 

CGGCTGATCACCAAGCCGCAGAACCTGAATGACGCCTACGGACCCCCCAGCAACTTCCTCGAGATCGATG 

TGAGCAACCTCCAAACGGTGGGGGTCGGCCGGGGCCGCTTC^CCACTTACGAAATC^GGGTC^GGTCGT 

AGTTCCCCCX3CTCCCTGGGAAAGCGTTTTTGCGTCAGCTTCCTTCT 

GACAATTTTATTGAGGAAAGAA^CAAGGGCTGGAGCAGTTTATAA^ 

CACAGAACGAACGTTGTCTTCACATGTTTTTACAAGATGAAATAA^ 

AATAAGACATGCCTGAAATTTGGC^^GAAGGGGCAAA^CGTGACTATTAATGATTGATAAGCACCAGTG 

AAGAAGTTCTiyVCTTTTAGCATGCTGCACAGAAACTC 

TGCTCAGTTTTGTTTTGTTTTGGCAGTTGACAAGAAGT^ 

CCTTTCTATATAAATAGCTCTTTCTTGCTGTTXTAATGTGGTGCACACTATAGCCTCACAAACCTGTTAT 
TCCAGTGTAATCTGCAGTGTCGTAACTAAAGTTACTGGCTTGGTCTTATTTGCACAGTTTTTGCGTCTTG 
TTTGCTTCTTGCATCTGATTAACTAGAATATTTCTCTTTCCCCCTTTTAATTTGTGATGTCACTTGACCC 
(^TTTATGTGTAGGAGC^CTA(^CCATTGGTTTCCAATACTGCACACATAAGATACATACTTGTGTGC^G 
AAAGTATCTTCCTCCAGGCTTGTAATACCCTTCACATGGAAGATTAATGAGGGAAATCTTTATATTCTGT 
ATAAAAACAJ^GCAAATTTATATACTAAAATC^^ 

TAAAATGCATTTCTGATATGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human SNX3 mRNA sequence - var8 (public gi: 34304374) (SEQ ID NO: 169) 

GTCCGGCCGGAACCTGTTTGCGACCCCGAGTCCCATGACACCGCTTCTCCTCACACCCCAGTCCGCAGTG 

CCCCTCCCCAGCCTCGGCCGGGCCTCCCGGGAGCCGGGCGTGGCGTTCCAGCTAGTGAGCCGTTTCTCCC 

CTGGGCTCGGAGGCGGAAGCTTGAGGGGCGCGGGGAGGAGCTTCGCGTGCGGGGTGAACGCCCGCTCTAC 

GTGCTCGTTCTCTTCGCGACCGCTGCGCGCGAGCCCCGTGTCCCCACGGCGGGCAGCAGCGGCGGCGGCG 

GCGGCTGAACGCGGAGGGGGCGGAGGGAGCCCGCGGCGGCGGCAGCAGCTACAGCGAAATGGCGGAGACC 

GIK^CTGACACCCGGCGGCTGATC^CCAAGCCGCAGAACCTGAATGACGCCTACGGACCCCCCAGCAACT 

TCCTCGAGATCGATGTGAGCAACCCGCAAACGGTGGGGGTCGGCCGGGGCCGCTTCACCACTTACGAAAT 

CAGGGTCAAGACAAATCTTCOTATTTTCAAGCTGAAAGAATCTACTGTTAGAAGAAGATACAGTGA 

GAATGGCTGCGAAGTGAATTAGAAAGAGAGAGCAAGGTCGTAGTTCCCCCGCTCCCTGGGAAAGGGTTTT 

TGCGTC^GCTTCCTTTTAGAGGAGATGATGGAATATTXGATGACAATTTTATTGAGGAAAGAAA 

GCTGGAGCAGTTTATAAACAAGGTCGCTGGTC^TCCTCTGGCACAGAACGAACGTTGTCTTCACATGTTT 

TTACAAGATGAAATAATAGATAAAAGCTATACTCC^TCT^ 

GGGGCAAAAACGTGACTATTAATGATTGATAAGCACCAGTGAAGAAGTTCTAACTTTTAGCATGCTGCAC 

ACAAGAAGTTAATTTGCTTTAGTAAAAATCCCTCATTCCAGCCTTTCTATATAAATAGCTCTTTCTTC 

GTTTTAATGTGGTGCACACTATAGCCTCACAAACCTGTTATTCCAGTGTAATCTGCAGTGTCGTAACTAA 

AGTTACTGGCTTGGTCTTATTTGCACAGTTTTTGCGTC 

ATTTCTCTTTCCCCCTTTTAATTTGTGATGTCACTTGACCCCATTTATGTGTAGGAGCACTACACCATTG 
GTTTCCAATACTGCACACATAAGATACATACTTGTC 

CTTCACATGGAAGATTAATGAGGGAAATCTTTATATTCTGTATAAAAACA?^AAGCAAATTTATATACTAA 

AATCATTTGTCTAAAAATTTAAGTTGTTTTCAAATAAJ^ 

AAAAAAAAAAAAAAAAAAAAAAAAAA 

Human SNX3 mRNA sequence - var9 (public gi: 3 0 5 8 3 0 6 6) (SEQ ID NO: 170) 

ATGGCGGAGACCGTGGCTGACACCCGGCGGCTGATCACC^ 

CCCCCAGCAACTTCCTCGAGATCGATGTGAGCAACCCGCAAACGGTGGGGGTCGGCCGGGGCCGCTTCAC 
CACTTACGAAATCAGGGTCAAGACAAATCTTCCTATTTTCAAGCTGAAAGAATCTACTGTTAGAAGAAGA 
TACAGTGACTTTGAATGGCTGTOAAGTGAATTAGAAAGAGAGAGC^ 

GGAAAGCGTTTTTGCGTCAGCTTCCTTTTAGAGGAGATGATGGAATATTTGATGACAATTTTATTGA 

AAGAAAACAAGGGCTGGAGCAGTTTATAAACAAGGTCGCTGGTCATCCTCTGG 

CTTCACATGTTTTTACAAGATGAAATAATAGATAAAAGCTATA 

Human SNX3 mRNA sequence - varlO (public gi: 3127052) (SEQ ID NO: 171) 

GGGCGAGGAGGGAGCCCGCGGCGGCGG(^GC^GCTACAGCGAAATGGCGGAGACC 
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CGGCTGATCACCAAGCCGC^GAACCTGAATGACGCCT^ 

TGAGC^UVCCCGCAAACGGTGGGGGTCGGCCGGGGCCGCTTCACCACTTACGAAATCAGGGTCAAGACAA^ 

TCTTCCTATTTTCAAGCTGAAAGAATCTACTGTTAGAAGAAGATACAGTGACTTTGAATGGCTGCGAAGT 

GAAOTAGAAAGAGAGAGCAAGGTCGTAGTTCCCCCGCTCCCTGGGA 

TTAGAGGAGATGATGGAATATTTGATGACAATTTT^^ 

AAAC^GGTCGCTGGT<^TCCTCTGGCACAGAACGAACGCT^ 

ATAGATAAAAGCTATACTCCATCTAAAATAAGACATGCCTGAA 

CTATTAATGATTGATAAGCACCAGTGAAGAAGTTCTAACTTTTAGCATGCTO 

ATGCCTTCAGTATACTAACACTCATATGCTCAGTTTTGTTTTC 

GCTTTAGTAAAAATCCCTC^TTCCAGCCTTTCTATATAAATAGCTCTTTCTTGCTGTTT^ 

ACACTATAGCCTCACAAACOTGTTATTCCAGTGTAATCTGCAGTGTCGTAACTA 

CTTATTTGCACAGTTTTTGCX5TCTTGTTTGCTTCTTGCATCTGATTAAOT 

TTTTAATTTGTGATGTCACTTGACCCC^TTTATGTGTAGGAGCACTACACC^TTGGTTTCCAATACTGCA 
CACATAAGATAC^TACTTGTGTGCAGAAAGTATCTTCCT 

TAATGAGGGAAATCTTTATATTCTGTATAAAAACAAAAGCAAATTTATATACT 

AATTTAAGTTGTTTTCAAATAAAAATTAAAATGCATTTCTGATATO 

AAAAAAAAAAA 

Human SNX3 mRNA sequence - varl 1 (public gi: 3126978) (SEQ ID NO: 172) 

GCGGCACAGCTAC^GCGAAATGGCGGAGACCGTGGCTGACACCCGGCGGCTGATCACCAAGCCGCAGAAC 
CTGAATGACGCCTACGGACCCCCCAGCAACTTCCTCGAGATCGATGTGAGCAACCCGCAAACGGTGGGGG 
TCGGCCGGGGCCGCTTC^CCACTTACGAAATCAGGGTCAAGACAAATCTTCCTATTTTCAAGCTGAAAGA 
ATCTACTGTTAGAAGAAGATACAGTGACrTTGAATGGCTGCGAAGTGAATTAGAAAGAGAGAGCAAGGTC 
GTAGTTCCCCCGCTCCCTGGGAAAGCGTTTTTGCGTCACTTCCCTTTTAGAGGAGATGATGGAATATTTG 
ATGACAATTTTATTGAGGAAAGAAAACAAGGGCTGGAGCAGTTTATA7^CAAGGTCGCT(3GTCATCCTCT 
GGCACAGAACGAACGTTGTCTTCACATGTTTTTACAAGATGAAATAATAGATAAAAGCTATACTCCATCT 

AAAATAAGA^TGCCTGAAATTTGGCAAGAAGGGG 
GTGAAGAAGTTCTAACTTTTAGCATGCTGGACAGAAACT 

CATATGCTCAGTTTTGTTTTGTTTTGGCAGTTGACAAGAAGTTAATTTGCTTTAGTAAAAA^ 

CCAGCCTTTCTATATAAATAGCTCCTTCCTGCTGTTTTAATGTGGGTGCACACTATAGCCTCACAACCT 

GTTAATCCAGTGTAATCTGCAGTGTCGTAACTAAAGTACTGGCTTGGTCCTAATTG 

Human SNX3 protein sequence - varl (public gi: 23111041) (SEQ ID NO: 287) 

MAETVADTRRLI TKPQNLNDAYGPPSNFLE IDVSNPQTVGVGRGRFTTYEI RVKVVVP PLPGKAFLRQLP 
FRGDDGIFDDNFIEERKQGLEQFINKVAGHPLAQNERCLHMFLQDEIIDKSYTPSKIRHA 

Human SNX3 protein sequence - var2 (public gi: 23111043) (SEQ ID NO: 288) 

MAETVADTRRLI TKPQNLNDAYGPPSNFLE IDVSNPQTVGVGRGRFTTYEIRVKTNLP I FKLKE STVRRR 
YSDFEWLRSELERESKPCLRMTSEARSHGRTWCAQNDEKLFCD 

Human SNX3 protein sequence - var3 (public gi: 15779012) (SEQ ID NO: 289) 

MAETVADTRRLI TKPQNLNDAYGPPSNFLE I DVSNPQTVGVGRGRFTTYE I RVKTNLP I FKLKESTVRRR 
YSDFEWLRSELERESKVWPPLPGKAFLRQLPFRGDDGIFDDNFIEERKQGLEQFINKVAGHPLAQNERC 

LHMFLQDEI IDKSYTPSKIRHA 

Human SNX3 protein sequence - var4 (public gi: 3126979) (SEQ ID NO: 290) 

MAETVADTRRLI TKPQNLNDAYGP P SNFLE I DVSNPQTVGVGRGRFTTYE I RVKTNLP I FKLKESTVRRR 
YSDFEWLRSELERESKWVPPLPGKAFLRHFPFRGDDGIFDDNFIEERKQGLEQFINKVAGHPLAQNERC 

LHMFLQDEI IDKSYTPSKIRHA 

Human STSJX3 pray sequence - varl (SEQ ID NO: 173) 

GCCGCCATGGNAGTACCCATACGACGTACCAGATTACGCTCATATGGCCATGGAGGCCAGTGAATTCCAC 

CCAAGCAGTGGTATCAAOSCAGAGTGGCC^TTATGGCGGCGGCGGCGGCGGCTGAACGCGGAGGGGGCGG 

AGGGAGCCCGCGGCGGCGGCAGCAGCTACAGCGAAATGGCGGAGACCGTGGCTGACACCCGGCGGCTGAT 

CACCAAGCCGCAGAACCTGAATGACGCCTACGGACCCCCCAGCAACTTCCT 

CCGCAAACGGTGGGGGTCGGCCX3GGGCCGCTTCACCACTTACGAAA 

TTTTCAAGCTGAAAGAATCTACTGTTAGAAGAAGATACAGTGACTTTGAATGGCTGCGAAGTGAA 
AAGAGAGAGCAAGGTCGTAGTTCX^CCNNGCTCCCTGGGAAAGCGTTTTTGCGTCAGCTTCCTTTTAGAGG 
AGATGATGGAATATTTGATGACAATTTTATTGAGGAAAGAAAACAAGGGCTGGAG 
GTCGCTGGTCATCCTCTGGCACAAAACGAAC^TTGTCTTCACATGTTTTTACA^ 

AAAGCTNTACTCCATCTAAAATAAAACATGCCTGAANTTTGGCANAANGGGCNAAAACGTGACTATTATG 
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ATTGANAGCCCCNNNNAAAANTT 
CCTNNNTNCaSTANTTTNN^ 

Humaii S"NX3 pray sequence - var2 (SEQ ID NO: 174) 

. CCGCCATGGTAGTACCCATACGACGTACCAGTATTACGCTCATATGGCCATGGCAGGCCAGTGAATTCCA 
" CCCAAGGAGTGGTATCAACGCAGAGTGGCCATTATGKjC^ 
GGAGCTACAGCGAAATGGCGGAGACCGTGGCTGACACCCGGC^ 
-"^ TGACGCCTACGGACCCCCC^GCAACTTCCTCGAGATCGATGTGAGCAACCCGCAAACGGTGGGGGTCGGC 

CGGGGCCGCTTCACCACTTACGAAATCAGGGT 

CTGTTAGAAGAAGATACAGTGACTTTGAATGGCTGCGAAGTGAATTAGAAAGAGAGAGCAAGG 

TCCCCCGCTCCCTGGGAAAGCGTTTTTGCGTCAGCT^ 

AATTTTATTGAGGAAAGAAAACAAGGGCTGGANC^ 

GTGTANGACTNCTTTAAGTGACTACTTTTNTTTANATGTNAANNN 

CNTTTCCTANNTTTNATTTNTTTAA 

Unigene Name; SRA1 Unigene ID; Hs . 32587 Clone ID; 3GD_19 
Huniau SRA1 mRNA sequence - varl (public gi: 10436964) (SEQ ID NO: 175) 

ACGTGAAGCCGGGTGAGCGCAGCCGGCGGGCTAGGGCACTAGGTCGTCGCCCCGGCCTAGGCTGGGGGCG 
GTTGCGGCGCTTAGTATGGACCCTCTGTCTCCCCCAGCCCCAGTATAAGCTAACAGTGGAGTTCCGGGCT 
CGCTTCACAGATCCCTCGCCTCCGCAGGCAACAAGGAA 
CGGGCTGC^GACCCAGGCCGGCGGACCC^GGCGCTC 

GGATCCCCCAGAGTCCCCGCATCAGAGACTTCTCCTGGGCCTCCCCCAATGGGGCCTCCACCTCCTTCAA 

GTAAGGTTCCCAGGTCCCCACCTGTGGGGAGTGGTCCTGCCTCTGGCGTGGAGCCCACAAGTTTCCCAGT 

CGAGTCTGAGGCTCGACTGATGGAGGATGTGCTGAGACCTTTGGAACAGGCATTGGAAGACTGCCGTGG 

C^C^CAAGGAAGCAGGTATGTGATGACATCAGCCGACGCCTGGCACTGCTGC^GGAACAGTGGGCTGGAG 

SAAAGTTGTCAATACCTGTAAAGAAGAGAATGGCTCTACTGGTGCAAGAGCTTTCAAGCCACCGGTGGGA 

CGCAGCAGATGACATCCACCGCTCCCTCATGGTTGACCATGTGACTGAGGTCAGTCAGTGGATGGTAGGA 

GTTAAAAGATTAATTGCAGAAAAGAGGAGTCTGTTTTCAGAGGAGGCAGCCAATGAAGAGAAATCTGCAG 

CCACAGCTGAGAAGAACCATACCATACCAGGCTTCCA 

CCGGACACCATCTCCTATGCCTTGGAGACCTTCTG^ 

CCCACTGGGCCTGACCCACCTATGAGGGAAGAAGTCCCACCTGGGCCAGAGGGAGTTCATGTGTTACTCA 

TAAC^TGCATTTCAATAAAAACATCTCTGCGGTGGGCCTTGGGTAGGAGAGATGAACCCTTCCGGTGCCA 

AGCTAGTCCCCTCTGGTGTCCTCGACTGCCCTGCTCCCTGTGTATCTGCAAACCTCTGTTCTCCCTTCTC 

CATTGATC^GGAAGGGATCTGCTGGGTAAAGTCAGACTACT^ 

AAGCACATCCTGTGCTGGGCGGAGCAGCTGTGTTTGGATC 

TAGAACGGGGAGACTTTTCCCTCAACAAAAGGAAAGACAGTCCT^ 

CTACTGTTACAGAGATTAGAACCACATTGAGTGGGGTTTTCTGTGTAAATCGAAGGAGAAAAAGACCAGA 
TTACTGAGATTGGGGATTGTAACTCTGACTTGCC^ 

Human SRA1 mRNA sequence - var2 (pubHc gi: 9930611) (SEQ ID NO: 176) 

TCCTTTGGTGCCTTGTGACCAGGGCCCTGATGGTTCATTAGATGGAGCCTTCGAGTCTTAGGGAGTTGCC 

GCAGGGTCCCCACAGCGGCTCCCGACGGTTGTGAACCAGCATCCATCCTCCACGGATTCCGGCAACCCGC 

CTGGCCCTGGACGTGTCTCAACTGGCCCGCGTGAGGGGCCGCCCCGGAAATGACGCGCTGCCCCGCTGGC 

CAAGCGGAAGTGGAGATGGCGGAGCTGTACGTGAAGCCGGGCAACAAGGAACGCGGCTGGAACGACCCGC 

CGCAGTTCTCATACGGGCTGCAGACCCAGGCCGGCGGACCCAGGCGCTCGCTGCTTATCAAGAGGGTCGC 

CGCACCCCAGGATGGATCCCCGAGAGTCCCC^ 

CCACCTCCTTCAAGTAAGGCTCCCAGGTCCCCACCT^^ 

CAAGTTTCCCAGTCGAGTCTGAGGCTGTGATGGAGGATGTGCTGAGACCTTTGGAACAGGCATTGGAAGA 

CTGCCGTGGCCACACAAGGAAGC^GGTATGTGATGACATCAGCCGACGCCTGGCACTGCTGCAGGAACAG 

TGGGCTGGAGGAAAGTTGTCAATACCTGTAAAGAAGAGAATGGCTCTACTGGTGCAAGAGCTTTCAAGCC 

ACCGGTGGGACGCAGCAGATGACATCCACCGCTCCCTCATGGTTGACCATGTGACTGAGGTCAGTCAGTG 

GATGGTAGGAGTTAAAAGATTAATTGCAGAAAAGAGGAGTCTGTTTTCAGA 

AAATCTGCAGCCACAGCTGAGAAGAACGATACCATACCAGGCT^ 

CCCAGACT 

Human SRA1 mRNA sequence - var3 (public gi: 9930613) (SEQ ID NO: 177) 

TCCTTTGGTGCCTTGTGACCAGGGCCCTGATGGTTCATTAGATGGAGCCTTCGAGTCTTAGGGAGTTGCC 
GCAGGGTCCCCACAGCGGCTCCCGACGGTTGTGAACCAGCATCCATCCTCCACGGATTCC^ 
CTGGCCCTGGACGTGTCTCAACTGGCCCGCGTGAGGGGCCGCCCCGGAAATGACGCGCTGCCCCGCTG 
CAAGCGGAAGTGGAGATGGCGGAGCTGTACGTGAAGCCGGGCAACAAGGAACGCGGCTGGAACGACCCGC 



Figure 36 part - 95 



P C TV" U S 01 »4/ O 6 3 O B 

CGCAGTTCTCATACGGGCTGCAGACCCAGGCCGGCGGACCCAGGCGCTCGCTGCTTACCAAGAGGGTAGC 

CGCACCCCAGGATGGATCCCCCAGAGTCCCCGCATCAGAG^ 

CCACCTCCTTCAAGTAAGGCTCCCAGGTCCCCA^ 

CAAGTTTCCC^^TCGAGTCTGAGGCTCGACTGATGGAGGATGTGCTGAGACCTTTGGAACAGGCATTGGA 
AGACTGCCGTGGCCACACAAGGAAGCAGGTATGTGATGACATCAGCCGACG 

CAGTGGGCTGGAGGAAAGTTGTCAATACCTGTAAAGAAGAGAATGGCTCTACTGGTGCAAGAGCTTTC^ 

GCCACCGGTGGGACGCAGCAGATGACATCCACCGCTCCCTCATGGTTGACCATGTGACTGAGGTCAGTCA 

GTGGATGGTAGGAGTTAAAAGATTAATTGC^GAAAAGAGGAGTCTGTTTTCAGAGGAGGCAGCCAATGAA 

GAGAAATCTGCAGCCACAGCTGAGAAGAACCATACCATACCAGGCTTC 

TTCCCCAGACT 

Human SRA1 mRNA sequence - var4 (public gi: 4588026) (SEQID NO: 178) 

CGCTTGGCGGAGCTGTACGTGAAGCCGGGCAACAAGGAACGCGGCTGGAACGACCCGCCGCAGTTCTCAT 
ACGGGCTGCAGACCCAGGCCGGCGGACCCAGGCGCTCGCTGCTO 

TGGATCCCCCAGAGTCCCCGCATCAGAGACTTCrCCTGGGCCTCCCCCAATGGGGCCTCC^CCTCCTTCA 
AGTAAGGCTCCC^GGTCCCCACCTGTGGGGAGTGGTCCTGCCTCTGGCGTGGAGCCCACMGOTTCCCM 
TCGAGTCTGAGGCTGTGATGGAGGATGTGCTGAGACCTTTGGAACAGGCATTGGAAGACTGCCGTGGCCA 
CACAAGGAAGCAGGTATGTGATGACATCAGCCGACGCCTGGCACTGCTGCAGGAACAGTGGGCTGGAGGA 
AAGTTGTCAATACCTGTAAAGAAGAGAATGGCTCTACTGGTGCAAGA 

C^GCAGATGACATCCACCGCTCCCTCATGGTTGACCATGTGACTGAGGTCAGTCAGTGGATGGTAGGAGT 
TAAAAGATTAATTGCAGAAAAGAGGAGTCTGTTTT CAGAGGAGGCAGC CAATGAAGAGAAATCTGCAGCC 
AGAGCTGAGAAGAACCATACCATACCAGGCTTCCAGCAGGCT 

GGACACCATCTCCTATGCCTTGGAGACCTTCTGTCACTTGGGTCCCT^CTTACCACCACCAAGACTGTCC 
CACTGGGCCTGACCCACCTATGAGGGAAGAAGTCCCACCTGGGCCAGAGGGAGTTCATGTGTTACTCATA 
ACATGCATTTCAATAAAAACATCTCTGCGGTGGTG 

Human SRA1 rnRNA sequence - var5 (public gi: 25123254) (SEQID NO: 179) 

GGCGGAGCTGTACGTGAAGCCGGGCAACAAGGAACGCGGCTGGAACCCCGCCGCAGTTCTCATACGGGCT 

GCAGACCCAGGCCGGCGGACCC^GGCGCTCGCTGCTTACCAAGAGGGTCGCCGCACCCCAGGATGGATCC 

CCCAGAGTCCCCGCATCAGAGACTTCTCCTGGGCCTCCCCCAATGGGGCCTCCACCTCCTTCAAGTAAGG 

CTCCCAGGTCCCCACCTGTGGGGAGTGGTCCTGCCTCTGGCGTGGAGCCCACAAGTTTCCCAGTCGAGTC 

TGAGGCTGTGATGGAGGATGTGCTGAGACCTTTGGAACAGGCATTGGAAGACTGCCGTGGCCACAC^ 

AAGCAGGTATGTGATGACATCAGCCGACGCCTGGCACTGCTGCAGGAACAGTGGGCTGGAGGAAAGTTGT 

CAATACCTGTAAAGAAGAGAATGGCTCTACTGGTGC^AGAGCTTTCAAGCCACCGGTGGGACGCAGCAGA 

TGACATCCACCGCTCCCTCATGGTTGACCATGTGACTGAGGTCAGTCAGTGGATGGTAGGAGTTAAAAGA 

TTAATTGCAGAAAAGAGGAGTCTGTTTTCAGAGGAGGCAGCCAATGAAGAGAAATCTGCAGCCACAGCTG 

AGAAGAACCATACCATACCAGGCTTCCAGCAGGCTTCATAATC 

ATCTCCTATGCCTTGGAGACCTTCTGTC^CTTGGCTCCCTTCTTACC^CCACCAAGACTGTCCC^CTGGG 
CCTGACCC^CCTATGAGGGAAGAAGTCCCACCTGGGCGAGAGGGAGTTCATGTGTTACTCATAACATGCA 
TTTCAATAAAAACATCTCTGCGGTGAAAAAAAAAAAAAl^A 

Hiunau SRA1 rnRNA sequence - var6 (public gi: 18027813) (SEQ ID NO: 180) 

GCAGGCACTAAGCTGGGCACTGGGAATGTAATAAAATAGTCA 

GGGAAACGAACAAGAGTCAAATAAGGCAGAAGATGTGATGTAATACACCTACGAAATCTCAGAGGGTTGT 

AGGGTCGTGGGAGCTGAAGTGAGACACTTAACCTGGCCTGAGACATTCCAGAAGGCCTCCTGAAGAACTG 

ACATCTGAACTGAGAACTGAAGGAAGATGAGTACTAGTGAGGCTACCGGACGTGAATGTGGAGATTGTGC 

AGGGCAATGCAAGAGGAGGCTGTAGAAGTCAACCTGGCTAGATCACAGCGGGGTGTATGTGGGGCAGGAG 

CTTCTTTGTTTGAATTTGCTCCTGAGAGGATGAGGCCTCCTAGAGCACTGGCTCCTGGACAGCAACCTCC 

TTTGGTGCCTTGTGACCAGGGCCCTGATGGTTCATTAGATGGAGCCTTCGAGTCTTAGGGAGTTGCCGCA 

GGGTCCCCACAGCGGCTCCCGACGGTTGTGAACCAGCATCCATTCTCGACGGATTCCGG 

GCCCTGGACGTGTCTCAACTGGCCCGCGTGAGGGGCCGCCCCGGAAATGACGCGCTGCCCCGCTGGCCAA 

GCGGAAGTGGAGATGGCGGAGCTGTACGTGAAGCCGGGCAACAAGGAACGCGGCTGGAACGACCCGCCGC 

AGTTCTCATACGGGCTGCAGACCCAGGCCGGCGGACCCAGGCX^ 

ACCCCAGGATGGATCCCCCAGAGTCCCCGC^TCAGAGACTO 

CCTCCTTCAAGTAAGGCTCCGAGGTCCCCACCTC 

GTTTCCC^GTCGAGTOTGAGGCTCGACTGATGGAGGATGTGCTGAGACCTTTGGAACAGGCATTGGAAGA 
CTGCCGTGGCCACACAAGGAAGCAGGTATGTGATGACATCAGCCGACGCCTGGCACTGCTGCAGGAACAG 
TGGGCK^AGGAAAGTTGTCAATACCTGTAAAG 

ACCGGTGGGACGCAGCAGATGACATCCACCXSCTCCCTCATGGTTGACCATGTGACTGAGGTCAG^ 
GATGGTAGGAGTTAAAAGATTAATTGCAGAAAAGAGGAGTCTGTTTTC^GAGGAGGCAGCaiATGAAGAG 
AAATCTGCAGCCAGAGCTGAGAAGAACCATACCATACCAGGCTTCGAGCAGGCTTCATAATCCTCG 
CCCAGACTCACCGGACACCATCTCCTATGCCTTGGAGACCTTCTGTCACTTGGCTCCCrTCTT 
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CCAAGACTGTCCCACTGGGCCTGACCCACCTATGAGGGAAGL?^GTCCCACCTGGGCCAGAGGGAGTTCAT 
GTGTTACTCATAACATGCATTTCAATAAAAACATCTCTGCGGTGGAAAAAAAAAAAAAAAAAAA 

Human SRA1 mRNA sequence - var7 (pubHcgi: i6549596)(SEQlDNO: 181) 

TTATAGCAAAATCAGTGOVAATAAAAATCCCTCAGTGACCTCACTGGATGTGAGTATATTGGG^ 

CAGGGCTGGGGGCTAACACCGTGTGTGAGATGAGTGTCTTTGTGTCTGTGCTTGATGTTGGTGGCTOT 

GTAGTCACATGACAGCATGGGTGTGATGGAGATCTGACTTCA^ 

CCTGTGC CAGGCACTAAG CTGGGCACTGGGAATGTAATAAAATAGTCAA.GGT CCCACCTTCTAAGACTGT 
CCGACAGGGAAACGAACAAGAGTCAAATAAGGCAGAAGATGTGATGTAATACACCTACGAAATCTCAGAG 
GGTTGTAGGGTCGTGGGAGCTCAAGTGAGACACTTAACCTGGCCTGAGACATTCCAGAAGGCCTCCTGAA 
GAACTGACATCTGAACTGAGAACTGAAGGAAGATGAGTACTAGTGAGGCTACCGGACGTGAATGTGGAGA 
TTGTGCAGGGCAATGCAAGAGGAGGCTGTAGAAGTCAACCTGGCTAGATCACAGC 

CAGGAGCTTCTTTGTTTGAATTTGCTCCTGAGAGGATGAGGCOTCCTAGAGCACTGGCTCCTGGACAGC^ 
ACCTCCTTTGGTGCCTTGTGACCAGGGCCCTGATGGTTCATTAGATGGAGCCTTCGAGTCTTAGGGAGT^ 
GCCGCAGGGTCCCCACAGCGGCTCCCGACGGTTGTGAACCAGCATCCATTCTCCACGGATTCCGGCAACC 
CGCCTGGCCCTGGACGTGTCTCAACTGGCCCGCGTGAGGGGCCGCCCCGGAAATGACGCGCTGCCCCGCT 
GGCCAAGCGGAAGTGGAGATGGCGGAGCTGTACGTGAAGCCGGGCAACAAGGAACGCGGCTGGAACGACC 
CGCCGCAGTTCTCATACGGGCTGCAGACCCAGGCCGGCGGACCCAGGCGCTCGCTGCTTACCAAGAGGGT 
AGCCGCACCCCAGGATGGATCCCCCAGAGTCCCCGCATCAGAGACTTCTCCTGGGCCTCCCCCAATGGGG 
CCTCCACCTCCTTCAAGTAAGGCTCCCAGGTCCCCACCTGTGGGGAGTGGTCCTGCCTCTGGCGTGGAGC 
CCACAAGTTTCCCAGTCGAGTCTGAGGCTCGACTGATGGAGGATGTGCTGAGACCTTTGGAACAGGCATT 
GGAAGACTGCCGTGGCCACACAAGGAAGCAGGTATGTGATO 

GAACAGTGGGCTGGAGGAAAGTTGTCAATACCTGTAAAGAAGAGAATGGCTCTACTGGTGCAAGAGCTTT 
CAAGCCACCGGTGGGACGCAGCAGATGACATCCACCGCTCCCTCATGGTTGACCATGTGACTGAGGTCAG 
TCAGTGGATGGTAGGAGTTAAAAGATTAATTGCAGAAAAGAGGAGTCTGTTTTCAGAGGAGGCAGCCAAT 
GAAGAGAAATCTGCAGCCACAGCTGAGAAGAACCATACCATACCAGGCTTCCAGCAGGCTTCATAATCCT 
CGGTTCCCCAGACTCACCGGACACCATCCCC^^ 

# CCACC^CCAAGACTGTCCCACTGGGCCTGACCCACCTATGAGGGAAGAAGTCCCACCTGGGCCAGAGGGA 
GTTCATGTGTTACTCATAACATGCATTTCAATAAAAACATCTCTGCGGTGGGCCTTGGGTAGGAGAGATG 
AACCCTTCCGGTGCCAAGCrAGTCCCCTCTGGTGTC 
TCTGTTCTCCCTTCTCCATTCATCAGGAAGGGAT 

TCCCAAAGTAGACTGAAAGCACATCCTGTGCTGGGCGGAGCAGCTGTGTTTGGATGGTTTCATTTCAGCA 

TGAGAACAGACTCAAATAGAACGGGGAGACTTTTCCCTCAACAAAAGG71AAGACAGTCCTATTTGCACT 

TATCACCCTTGAGATACTACTGTTACAGAGATTAGAACC 

Human SRA1 mRNA sequence - var8 (public gi: 9930609) (SEQ ID NO: 182) 

TCCTTTGGTGCCTTGTGACGkGGGCCCTGATGGTTCATTAGA 

GCAGGGTCCCCACAGCGGCTCCCGACGGTTGTGAACCAGCATCCATCCTCCACGGATTCCGGCAACCCGC 
CTGGCCCTGGACGTGTCTCAACTGGCCCGCGTGAGGGGCCGCCCCGGAAATGACGCGCTGCCCCGCTGGC 
CAAGCGGAAGTGGAGATGGCGGAGCTGTACGTGAAGCCGGGCAACAAGGAACGCGGCTGGAACGACCCGC 
CGCAGTTCTCATACGGGCTGCAGACCCAGGCCGGCGGACCCAGGCGCTCGCTGCTTACCAAGAGGGTAGC 
CGCACCCCAGGATGGATCCCCCAGAGTCCCCGCATCAGAGACTTCTCCTGGGCCTCCCCCAATGGGGCCT 
CCACCTCCTTCAAGTAAGGCTCCGAGGTCCCCACCTGTGGGGAGTGGTCCTGCCTCTGGCGTGGAGCCCA 
CAAGTTTCCCAGTCGAGTCTGAGGCTGTGATGGAGGATGTGCTGAGACCTTTGGAACAGGCATTGGAAG^ 

ctgccgtggccacacaaggaagcaggtatgtgatgacatcagccgacgcctggc^ctgctgcaggaacag 
tgggctggagga7vagttgtcaatacctgtaaagaagagaatggctctactggtgcaagagctttcaagcc 
accggtgggaotcagcagatgacatccaccgc^ 

gatggtaggagttaaaagattaattgcagaaaagaggagtctgttttcagaggaggcagccaatgaagag 
aaatctgcagccacagctgagaagaaccataccataccaggcttco\gcaggcttcataatcctcggttc 
cccagact 

Human SRA1 protein sequence - varl (public gi: 9930610) (SEQ ID NO: 291) 

OTRCPAGQAEVEMAELYVKPGNKERG^ 

G P PPMGPPPPSSKAPRS PPVGSGPASGVE PTS FPVESEAVMEDVLRPLEQALEDCRGHTRKQVCIDDI SRR 
LALLQEQWAGGKLS I PVKKRMAIJLVQELSSHRWDAADDIHRSLMVDHVTEVSQW^WGVKRLIAEKRSLPS 
EEAANEEKSAATAEKNHTI PGFQQAS 

Human SRA1 protein sequence - var2 (public gi: 25123255) (SEQ ID NO: 292) 

MGPPPPSSKAPRSPPVGSGPASGVEPTSFPVESEAVMEDVLRPLEQALED 

QEQWAGGKIiS I PVKKRMALLVQEIiSSHRWDAADDI HRSIiIWDHVTEVSQWMVGVKRLI AEKRSLFS EEAA 
NEEKSAATAEKNHTI PGFQQAS 



Figure 36 part - 97 



IP C T / U S O O B 3 O 8 

Human SRA1 protein sequence - var3 (public gi: 9930614) (SEQ ID NO: 293) 

MTRCPAGQAEVEMAELYVKPGNKERGWNDPPQFS YGLQTQAGGPRRSLLTKRVAAPQDGS PRVPASETS P 
GPPPMGPPPPSSKAPRSPPVGSGPASGVEPTSFPVESEARLMEDVLRPLEQALEDCRGHTRKQVCDDISR 
RLALLQEQWAGGKIiSIFVKKRMALIiVQI^ 
v SEEAANEEKSAATAEKNHTI PGFQQAS 

Human SRA1 protein sequence - var4 (public gi: 9930612) (SEQ ID NO: 294) ■ 

^MTRCPAGQAEVEMAELYVKPGNKERGV^PPQFSYGI^^ 

GPPPMGPPPPSSKAPRSPPVGSGPASGVEPTSFPVESEAVMEDVXiRPLEQALEDCRGHTRKQVCT 

LALLQEQWAGGKLS I PVKKRMALLVQELSSHRWDAADDIHRSLMVD^ 
E E AANEEKS AATAEKNHTI PGFQQAS 



Unigene Name : SYNE 1 Uni gene ID : Hs - 416719 Clone ID : 3GD i _138aa2938 
Human SYNE1 mRNA sequence - varl (public gi: 21753084) (SEQ ID NO: 183) 

GTACAAAAACGAACTTTCACAAAATGGATCAACTCTCATCTGGC 

ACGATCTTTTTGAAGACATGAAAGATGGTGTTAAACTGCTTGCCC1TCTGGAGGTCCTGTCTGGGCAGAA 

ACTGCCTTGTGAACAAGGACGCCGGATGAAGCGAATCCATGCTGTGGCTAACATTGGCACGGCACTCAAG 

TTCCTCGAAGGAAGAAAGATTAAATTAGTCAA<^^ 

TAGTTCTTGGATTGATGTGGACCATTACT 

GCTCCAGTCTTTGTCCAGCAGCGCATCCTCra^ 

AGTAAACGGAAGGTGACCACCAAGATCCAAGGAAATGCTAAGAAGGCTTTATTAAAGTGGGTTCAGTACA 
CAGCTGGCAAGCAGACTGGAATAGAAGTAAAAGATTTTGGGAAGAGTTGGAGAAGCGGGGTTGCCTTTCA 
TTCAGTTATTCATGCCATTCGACCGGAATTGGTGG^ 

AATTTGGAGGATGCTTTCACTATCGCTGAAACAGAACTGGGGATCCCAAGACTGCTAGATCCTGAAGA 

TTGATGTGGATAAACCAGATGAGAAATCTATTATGACCTATGTAGCCCAGTTTCTGAAAC^TTATCCTGA 

CATCCACAATGCAAGCACTGATGGGCAAGAGGATGA 

TCTGTACAAAATTTTAAGAGAGAAGACAGAGTAATTTTTAAGGAAATGAAAGTT^ 
AGAGAGATTTGACAAGAGCACAGATGGTGGAATCAAATTTACAGGATAAATATCAG 
CAGAGTTCAATATGAAATGAAGAGGAAACAGATTGAACATTTAATACAACCATTACACAGAGAC 
TTGTGACTTGACCAAGCATTGGTAAAACAATCTTGGGATAG 

TAGAGCTTGATAAATCTCTTCCTGCACCTCTGGGCACCATAGGTGCCTGGOTGTACA^ 
CCTGAGAGAGGAAATAACCGTTCAACAGGTCCACGAGGAAACAGCAAACACGATACAACGGAAACTTGAG 

CAACATAAGGATCTGCTTCAAAACTVCGGATGCCC^^ 

CTGTTAACGGGATTCCAGTGCCACCTGATCAATTAGAGGACATGGCCGAGAGGTTTC^ 

C^CATCAGAGCTACACCTAATGAAAATGGAATTTTTAGAATTAAAGTACCGTCTGCTCT 

CTTGCAGAGTCAAAGCTGAAGTCTTGGATCATTAAGTACGGGAGG 

AAAACTACGTGTCTTTTATAGAAAATAGCAAGTTCTTTGAACAATATGAGGTGACATACCAGATCTTGAA 

ACAGACAGCTGAGATGTATGTCAAAGCAGATGGTTCAGTGGAAGAAGCTGAGAATGTGATGAAATTCATG 

AATGAAACCACCGCTCAGTGGAGGAATCTCTCAGTAGAAGTGAGGAGTGTGAGGAGCATGCTGGAAGAAG 

TGATCTCTAACTGGGATCGCTATGGCAATACAGTGGCTAGTCT 

AATGCTCAATCAATCAGAAAATGCCAAAAAGGATTTTTTTCGAAATTTA 

ACTGCCATGAACGATGCTGGCAATTTTCTAATTGAAACCT 

AGCAATTACTGTTGCTAAATGGGCGGTGGAGGGAGTTGTTTATC 

TGAGATGGACAGAATGAAGAAGGAATACACAGACTGTGTTGTTACCCTGTCTGCTTTTGCAACGGAAGCC 

CATAAGAAACTTTCTGAACCCTTAGAAGTCTCCT 

ATATTGAGCAGAGGGTGCCTGTGATGGATGCCGAATAG^ 

CAAAGAAAGCCCCC 

Human SYNE1 mRNA sequence - var2 (public gi: 22382201) (SEQ ID NO: 184) 

AGCGGCTGCCTCCTTGTTGAGTGCTGCAAAGGCCTC 
CGAAGCATGTTTTCTAGAGTGGTGTAGCCCTC 

TGCTTCCGGAGGGACCATGGCAACCTCCAGAGGGGCCTCCCGGTGTCCTCGGGATATCGCCAATGTGATG 

CAGAGGCTGC^AGATGAGCAAGAGATAGTACAAAAACGAACTTTCACAA^ 

CCAAGCGGAAACCTCCAATGGTGGTGGACGATCTTTTTGAAGAC 

CCTTCTGGAGGTCCTGTCTGGGGAGAAACTGCCTTC 

GTGGCTAAGATTGGCACGGCACTCAAGTTCCTC 

CCGATATAGCTGATGGCCGACCCrrCAATAGTTCTTGGATTGATGTGGACCATTATTCTATA 
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TGAAGAGTTGACCAGCAACCTGCCCCA 

GTTAGCTCTGAGACTCC CAGC CAACCAAGTAAACGGAAGGTGACCACCAAGAT CCAAGGAAATGCTAAGA 
AGGCTTTATTAAAGTGGGTTCAGTACACAGCTGGGAAGCAGACTGGAATAGAAGTAAAAGAT^ 
GAGTTGGAGAAG CGGGGTTGC CTTTC^TTCAGTTATTCATGCCATTCGACCGGAATTGGTGGACTTGGAG 
ACAGTGAAAGGCAGATCCAACCGAGAAAATTTGGAGGATGCnTT^ 

TCCCAAGACTGCTAGATCCTGAAGACGTTGATGTCK3ATAAACCAGATGAGAAATCTATTATGAO 

AGCCCAGTTTCTGAAACATTATCCTGACATCCACAATGCAAGCACTGATGGGCAAGAGGA^ 

CTTCCAGGTTTCCCATCTTTTGCAAATTCTGTACAAAATTTTAAGAGAGAAGACA 

AAATGAAAGTTTGGATAGAACAATTTGAGAGAGATTTGACAAGAGCACAGATGGTGGAATCAAATTTA 

GGATAAATATCAGTCATTTAAGCACTTCAGAGTTCAATATGAAATGAAGAGGAAACAGATTGAAC^TTTA 

ATACAACCATTACACAGAGACGGTAAATTGTCACTTGAC 
TGACCTCCAGGCTCTTTGACTGGCATATACAGCTTO 

TG CCTGG CTGTACAGAGCGG AGGTGGC CCTGAGAGAGGAAATAACCGTT CAACAGGTCCACGAGGAAACA 
GCAAACACGATACAACGGAAACTTGAGCAACATAAGGATCTGCTTCAAAACA 

CATTCCATGAAATCTACCGGACCAGGTCTGTTAACGGGATTCCAGTGCCACCTGATCAATTAGAGGACAT 
GGCCGAGAGGTTTCATTTTGTTTCCTCCACATCAGAGCTACACCTAATGAAAATGGAATTTTTAGAATTA 
AAGTACCGTCTGCTATCACTGCTGTTCTTGCAGAGTCAAAGCTGAAGTCTTGGATCATTAAGTACGGGAG 
GAGAGAGTCAGTGGAGC^GCTTCTACAAAACTACGTGTCTTTTATAGAAAATAGCAAGTTCTTTGAACAA 
TATGAGGTGACATACCAGATCTTGAAACAGACAGCTGAGATGTATGTCAAAGCAGATGGTTCAGTGGAAG 
AAGCTGAGAATGTGATGAAATTCATGAATGAAACCACCGCTCAGTGGAGGAATCTCTCAGTAGAAGTGAG 
GAGTGTGAGGAGCATGCTGGAAGAAGTGATCTCTAACTGGGATCGCTATGGCAATACAGTGGCTAGTCTG 
CAAGCCTGGCTAGAGGATGCTGAAAAAATGCTCAATCAATCAGAAAATGCCAAAAAGGATTTTTTTCGAA 
ATTTACCTCATTGGATTCAGCAGCATACTGCCATGAACGATC 

TGAGATGGTTTCCCGTGACCTGAAGCAGCAATTACTGTTGCTAAATGGGCGGTGGAGGGAGTTGTTTATG 
GAAGTCAAGCAATATGCTCAAGCTGATGAGATGGACAGAATC 

CCCTGTCTGCTTTTGCAACGGAAGCCCATAAGAAACTTTCTGAACCCTTAGAAGTCTCTTTTATGAATGT 

CAAGCTATTAATTCAAGACTTGGAGGTGAGGGGTTTCTGAATCAAAATGAAAAGCCT 

AGGAAAGATCAGCAAGTTTATTCAGATCATTGCAAAAGCTGTTTCTGTGTCTCCTGGGCATCATTT^ 

ATGTCTGTATGTCCCAATTTGCACCTGTCAGAAAT^AATGTATTGAACATAAAAAAGACATGACTTGATC 

ATATAAAGTAACTTCAAATTGTTAAAAAAAAAAAAAAGAAAAAAAAAAAAA 

Human SYNE1 mRNA sequence - var3 (public gi: 2819262?) (SEQ ID NO: 185) 

AGTACGCGGGAGTCTTAAAACGGAAGAAGAAAAAGAAGCAGTTCAGTCTT^ 

TGAGTGCTGCAAAGGCCTGGAATTCATTTATGACAGAATAGATCTAGAAAAG 

AGTGGTGTAGCCCTGTGCTGCCTCCAGTGAAGAGTCTCTTGGTG 

TGGCAACCTCCAGAGGGGCCTCCCGGTGTCCTCGGGATATCGCCAATGTGATGCAGAGGCTGCAAGATGA 

GCAAGAGATAGTACAAAAACXSAACTTTCACAAAATGGATCAACTCTC^TCTGGCCAAGCGGAAACCTC^ 

ATGGTGGTGGACGATCTTTTTGAAGACATGAAAGATGGTGTTAAACTGCTTGCCCTTCTGGAGGTCCTGT 

CTGGGCAGAAACTGCCTTGTGAACAAGGACGCCGGATGAAGCGAATCCATGCTGTGGCTAACATTGGCAC 

GGCACTCAAGTTCCTCGAAGGAAGAAAGATTAAATTAGTCAACATTAACTCCACCGATATAGCTGACGGC 

CGACCCTCAATAGTTCTTGGATTGATGTGGACCATTATTCTATATTTCCAGATTGAAGAGTTGACCAGCA 

ACCTGCCCCAGCTCCAGTCTTTGTCCAGCAGCGCATCCTCCGTGGACAGCATAGTTAGCTCTGAGACT 

CAGCCCACCAAGTAAACGGAAGGTGACCACCAAGATCCAAGGAAATGCTAAGAAGGCTTTATTAAAGTGG 

GTTCAGTACACAGCTGGCAAGCAGACTGGAATAGAAGTAAAAGATTTTGGGAAGAGTTGGAGAAGCGGGG 

TTGCCTTTCATTCAGTTATTCATGCCATTCGAC^ 

CAACCGAGAAAATTTGGAGGATGCTTTCACTATCGCCGAAACAGAACTGGGGATCCCAAGACTGCTAGAT 
CCTGAAGACGTTGATGTGGATAAACCAGATGAGAAATCTATTATGACCTATGTAGCCCAGTTTCTGAAAC 
ATTATCCTGACATCCACAATGCAAGCACTGATGGGG^ 
TTTTGCAAATTCTGTACAAAATTTTAAGAGAGAAGAC^ 

GAACAATTTGAGAGAGATTTGACAAGAGCACAGATGGTGGAATCAAATTTACAGGATAAATATCAGTCAT 
TTAAGCACTTCAGAGTTCAATATGAAATGAAGAG 

AGACGGTAAATTGTCACTTGACCAAGCATTGGTAAAACAATCTTGGGATAGAGTGACCTCCAGGCTCTT^ 
GACTGGCATATACAGCTTGATAAATCTCTTCCT^ 

CGGAGGTGGCCCTGAGAGAGGAAATAACCGTTCAACAGGTCCACGAGGAAA(^GC^ACACGATACAACG 
GAAACTTGAGCAACATAAG 

Human SYNE1 mRNA sequence - var4 (public gi: 21734187) (SEQ ID NO: 186) 

GGGACACAGTGAGAAACCACTGTATGAAGTGTTTGGGGCTCAGA 

GCAGGGGCAACAAAGCCGGAGTCCTGGGAGAGACTT03GGAAAGAAACCATGGAGCAGAGT 
GAATCGCAGTTGGT(^GGTGGGCTCGGAAGTGCTTAGAGGAAGAAAGGAGGGGAAGCCTCCCTGCTTTGT 
GATTGGCCTGGTGCTTTTGATCA1TGGCAGTTTGTCTGGAATGACAGGCATGGAATGGACAAAGTGGAGA 
AGAGCCTGGCTGTGAAGCAGCCTGTTCACCATGGAAAAGAGCTGTGGCTCCTATCCTGAAGGTCGTGGAG 
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CCACAGCAGGATCTGCGGAGGGAGGTGCTGGGATCCTCCCTCCTO^GGATGTGCAGATO^ATATTGT 

ATCTTTCTGGATACCACAGGGAGAAGGGCATATTCGGCGGAGAGAGACCAAATGAAACCTTTTACAA^ 

CAGAGAGAAGTAGGGTGGTGGCCTAAACTAGGGGAAGCAGAATTGGGAATGGGGAGAATGGGAATGATGT 

GAGAAAT GAGATAGAGAAGACTCCTCCAGAACTCTCAGTCC^^ CGATTTTCTGG 
GCTGGGCATCTTGGTGAAAGATGCAGGTGGTCCTAGGCCCTGAGGACCACAAGAGGGAAGGAGCACTGTG 

GGTGCAGGTGGGCAAGGGAGGTGGGGCTGTGAGAGCAG^ 
TCTTGAGATACCTGCAGAATATCCy\AATGCAAAAGTC^ 

CAGAAATGCAGATTGGGTAGGTATTCACATGTAAATGGGAATGGTCCTGGAGTGAACGGAGGAGCT 
CAGGAAGAGTGTGTGAAGGAAAACAAGAAGGACCACCACCCAAGCCACACATGCAGTGAAGGGATC 
GAGAAACAGAAACTCTGTAAGGAAGGTGAATAAAAATAGAATAAAGAGTTGGAGGCTGATTTGTGGCACT 
TGGAAATGTATCTCATACATTCTGTCAAAGGAC^TCTGGGGAATTTCTGTTTGGTTCTGGTGGTTCACAT 

CAGATTCCCAAGGGATGACACTGTTCTAAAAAGAAAATGAOT 

AAGGCCTATTAGTCAGGCATATGGCATCTGAAGCAGAGCTGTCCAAAACCAGCCACTGGCCAGTTGGGAC 
TGTTGAGCTCTGAGATGGGACTGTGCAAATTGAGATGGGTTGTGCGTGGAAAACATGCTTACATGAATTT 
CAAAGACTTAGTACAAGAAAGAAAATAAAATATTAATAATTATATTGATTACATGTTATAATCCCTGTCT 
AATGTAGTGTTAAAATTAATTTTATAAGTTTCTTTTTACATTTCTAATGTGGCTACGAAACCTTTAAGAT 
TACATATATAGTTCACATAGAAATATATGGGACAGCGCTGCTCTGGAGTCTGGGCTGAAATCTCAGTTCT 
GCC^TGTACTTTCTGTTTAAACTTAGATAAGGAACCTAATTCCTCTGTGCCTCAGTTTTCTCATCTATAA 
AATGGGAATAACATTCCCAGGTACCCTATAGGGTTTCTATGTGATAAATTTGTGCTCAGACCAGAGCCTG 
GCTCATAAAAACACTCTCAGTCACTGTGAGTTCTTTTTTTTTTTTTTTTTTTTTTT 

AGTCTAGTTCTGTTGCCCAGGCTGGAGTGCAGTGGCACAATCTCGGCTCACTGCAACCTCCGCCTCCCCG 

GTTCAAGCGATCCTCCTGCCTCAGCCTCCCATGTAGCTGGGATTACAGGTACCTGTCACCACACCTdGCT 

AATTTTTGTATTTTTAGTAGAGATGGGGTTCC^CCATGTCGGCCAGGCTGGTCTCGAATTCCTGACCTCA 

GGTGATCCACCCGCCTTGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCACGCCCCACCCACTG 

TGAATTCTTATGATTCATTTCAGGAAAGCTTTGGTGAGCCTGCACGCTCCTCTGTGCGCTCAGGAGCTAT 

GTGTCTAGAATAACTGACTTCCTTTTTTTTCCCCTAGGAAAGTTATTTTCCTGCACAAGGGATTCAGGGT 

TTCCAGAACTTAGCTGTCAACTTAGACTGTGCTTTTTTTGCAGATGTAATGATCCCCGAGAGCCCTGAGG 

CCTATGTAAAACTCACAGAAAATGCAATCAAAAATACCT 

CCTCCCTCCCCCGTAGAGCAATCCrCCTCT^ 

AGCATGTTAAGGCACAGCCTTCCTCTCTTACTGCTGTACTAGAAAAAACAGCTGGTTAAATCCACACCGA 

GAATAAGATTTCACTAATCGAGCGAAATAAAATAACTTCTCAACTTGTTAAATGGTGATTGGTCTCATTA 

GGTATAGACCTCTCATGTCCATTAACTGCAGAAAATATGAGAAGGAAAACCCAGTCATCAGCCTCTGCGG 

CCTAGTGTTCTACGTGGTGTGGTAATTTCAGCTTCACTGCATGCAGACCTACCTGTG 

GGTGCAGGCTCTGGTCCCAAGTCCGCCAGCCTGCATGAGTGACCTTTGGCCCATC 

CTCATCTG^GAATCCCGTATGAGACAAGGGGTGAGATCAGATTTCAGCTCTAAAAAATATATGTAATTT 
TAATTTAAGAGGTCGTAAAAGATAATTTGAAATGAAAAATGTATTTCACGGTATGCTGAGCCATAGATAA 
GAACAGAACTATTCCTGAAACAAGAAGAATTAAAGAAAGAAAAATGGAACTAGTTTTGCTTAGTGTTGTG 
' TTAGACAAACTGTAGTGCAGGAGTTCAGGATCAGTGATGTGGGATGTCGGCGGGGAATAGAGTTTGAACG 
CAGTGATATGATATTGAATCACGGAGTTACTAGTTTACGCTTCAGTTTTTGAAGAAAATCAAAAGGACAG 
AAAGCAAAGTAACATTACTGAGAGGGTGATTCCCAGGGAGGGACCTCTCOTAGGTGTATCTAGAAGGCCT 
TTTTTTAGAAACAAATAAAAAGATTTAATAAAGCTTACTAATATTTGTTCTGCTTTCAC 
TTCACTGATGATCAAATGTTCCTGTGTAGTTCGAAGACTTTC 

TCAGTAATTTTACTU^GAAATGTTACAACTTTGAGAGGAGAATGAGCCAGAATTCTAGGCTATGAGTAA^ 

AACCTGCCTAGATGGAAATGTTAAAATCTTAGCTTTCTCCTGGTTTTGTTTCAGATCTTAGATAAAAAGC 

AAGTCGTTGOTAGTTTGATATCTCTGTATATATCTATTCTGAGGCACTCTTTTCTTGATTA 

TGCCTTCAGTAAATGATGCAGCAACCTGAGCCTTCCGTGACACTATCTTCCCCTGAGGTGCATGAAGAAA 

AATC^GAGGGAGGATCTTCCCCTGCTCACTAAGCGATAGCAGAAAGAACATGAGAAAAAGAACAGCTTTC 

T CCTTACTGAGATGCAGTAGACACCATTGCAGATTTTTAGAAAGGCCTTCC ACACTGACAACATT CAGAT 

GATCAGGGGTTCAGCTGGAAGAGGGTCAGGTGCACGAAGCTGTTCyVAAACGGACTGGAGAGCCGTOT 

GACGTCCGATCTGCTAGGGCTCTCAGTCAAGC^CCCT 

GGCACGATGGTACTAAATTTGCTTTGTAATTACCC^CACCTGCCATTTCTATGCTGCTGTAACTGAAATC 
ATTTCTGAAACTTCTCCTTGAATATCAAGCTTTAAATAAGTCAAATAGTTTGTCCAGTAAAGATTCT 
GGTTGCCACCGCAGGGGACCACAGTGCCCTAGAGTCACAGATCCGACAACTGGGCAAAGCCCTGGATGAT 
AGCCGCTTTCAGATACAGCAAACCGAAAATATC^TTCGCA^ 

CCAGCTACAAAGGCTACATGAAACTGCTGGGCGAATGCAGTAGCAGTATAGACTCCGTGAAGAGACTGGA 

GCACAAACTGAAGGAGGAAGAGGAGAGCCTTCCTGGCTTTGTTAACCTGCATAGTACCGAAAC^ 

GCTGGTGTGATTGACCGATGGGAGCTTCTCCAGGCCCAGGCATTGAGCLAAGGAGTTGAGGATGAAGCAGA 

ACCTCCAGAAGTGGCAGCAGTTTAAOTCAGACTTGAACAGCATCTGGGCCTGGCTGGGGGACACGGAGGA 

GGAGTTGGAAC^GCTCCAGCGTCTGGAACTCAGCACTGACATCCAGAC 

CTCAAGGAGCTCCAGAAAGCTGTGGACCACCGCAAAGCCATCATCCTCTCCATCAAT 

AGTTCACCCGGGCTGACAGCAAGGAGAGCCGGGACCTGCAGGATCGC 

GGACCGAGTGTGCTCTCTGCTGGAGGAGTGGCGGGGCCTGCTGCAGGATGCCCTGATGCAGTGCCAGGGT 
TTCCATGAAATGAGCCATGGTTTGCTTCTTATGCTGGAGAACATTGACAGAAGGAAAAATGAAATTGTCC 



Figure 36 part - 100 



P C TV U S O H*S O B 3 O & 

rTATT(^TTCTAACCTTGATGCAGAGATACTTCAGGACCATCACAAACAGCTTATGGTAAGATGTGTGAA 

GTTGTTTCTTTTTTTTTAACTTTTTGCATTATAGTCTCTACTTTA 

^TCTTTGTAAAGCAAACATGTC^ 
TTGCTGAGTTGTAAAAATCTTAGTGTTCT 

^TAATTTAAACTTCATGTTAC^TCT^^ 

ATATTAGAATAGAATGAGACAATTAAGTCTTTTGTTTGTTTTTGTTGTTOTTGTTTTTTGAGA^ 

.^^TC^CTCTGTTCC^^ 

GAGCAATTCTCCTGCCAATTCTCCTGCCTCACCCTC 
GCCTCGCTAATGTTTGTATTTTTAGTAGAGACAGGGTCT 
TGGCCGCAGGTGATCCACCTGCCTGAGCCTCCCAAAGTGCT 
GCCGAGTCTTTCAAAGAGGAATTAAATACATCAGATTAAACATGAA 

CCAAGACAAAATGGGATVACAAGGAGTAAACTTACTTTCATTATCTGG " 

CAAGGAAGGAGAAACTTTTTCTAGCACTAAATTCAAGAGGAAATTAACTGGTAGACTCTTATACAAGGAT 

CTTTGGACAATATAATGTAC^GTATATTTAAGTGACTTTATAGAAGATAAGGAAGCATATTTGAGTTCCA 

TTAGAAGAAAATATTATGCACTTTGTAGCTCTCTGTATTTTTAAAATGTTATGTCTTAACATTTAACACT 

CACCTAAACTACAGAATTGGTACCTTTTAATTCAGTACCATAATAGTCTTAGAAACCTAGAGGAAATAGC 

TGTGGAACTGCATTTTTACTTGACTTTGACCTCT 

TTTTCAAATCTTGACTAGATATCAGAGGATACCTAGACATACTTCTGCTTCGCTATATTTAATGTTGTGC 
TTTTCTGTTTAAAAGATTATCTTACATCTCACT'TGCATACTAATCTATATTTTAATTACTGTCATATATA 
CATTAACTAATTTGAACCTTCCAATAATACTGTGGACCAGGCATCAAATCAAACTGAGATCAGAGACGGT 
CAGGGGTCTTATAGAATATTTTTGGCAGAGGCAGGATTAGAACTCAGGCCGTGAGCTGCTGCATCCTTTA 
GTGTGTGAGCTCCACGTTTGATGCTCAGGTATAATTTCCCACCAAGTTAAGTTGATTGCATTCTGCACTT 

TTGGAGCTTTTGCCAATTATCAAAAATGCTTAGAAA^ 

CATGAGCTGTTGGAATCCCAACTCAGAGTAGCCTCTTTGCAAGACATGTCTTGCCAACTACTGGTGAATG 
CTGAAGGAACAGACTGTTTAGAAGCCAAAGAAAAAGTCCATGTTA 

GAAGGAGGTCAGTCGTCATATCAAGGAACTGGAGAAGTTATTAGACGTGTCAAGTAGTCAGCAGGATTTG 

TCTTCCTGGTCTTCTGCTGATGAACTGGACACCTCAGGGTCTGTGAGTCCCACATCAGGAAGGAGCACCC 

CAAACAGACAGAAAACGCCACGAGGCAAGTGTAGTCTCTCACAGCCCGGACCCTCTGTCAGCAGTCCACA 

TAGCAGGTCCACAAAAGGTGGCTCCGATTCCTCCCTTTCTGAGCCAGGGCCAGGTCGGTCCGGCCGCGGC 

TTCCTGTTCAGAGTCCTCCGAGCAGCTCTTCCCCTTCAGCTTCTCCTGCTCCTCCTCATCGGGCTTGCCT 

GCCTTGTACCGATGTCAGAGGAAGACTACAGCTGTGCCCTCTCCAACAACTTTGCCCGGTCATTCCACCC 

CATGCTCAGATACACGAATGGCCCTCCTCCACTCTGAACTAAGCAGATGCCATCTGCAGAAGTGCTGGTA 

GCATAAGGAGGATCGGGTCATAAGCAATCCCAAACTACCAACAAGAGGACCTTGATCTTGGCGAAAGC 

TCGGTGTGGCAGCTTTAGCCCTCCTCCAGATCACATGTGTGCAAATTATGGCTTCAGAGGTGGAAGATAA 

ACAGTGACGGGGGAACAAACAGACAACAAGAAGGTTTGGAAGAAATC 

CACTAAGGAGATTGAGTAAGGACCTCCAAAGTTCCCCGGACTCATGAATTCTGGGCCCTTGGCCCATTCT 
GTGCACAGCCAAGGACTTCAGTAGACCATCTO 

ATGACCCTCCCAAGCACCATGTCAGTGTCGTACAATCTACCAACCAACCAGTGCTGAAGAGATTTTAGAA 
CCTTGTAACATACAATTTTTAAGAGCTTATATGGCAGCTTCCTTTTTACCTTGTTTTCCTTTGGGGCATG 
ATGTTTTAACCTTTGCTTTAGAAGCACAAGCTGTAAATCTAAAAOT 

AAAACTAGACGTAATAAATAAGATCATGGAAGGCTTTATGTGAAAAAAGTTGAATGTTATAGTAAAAA?^ 
AAAGATATTTATGTATGTACAGTTTGCTAAAGCCAAGTTTTGTTTGTATTGATTTCTTTGCATTTATTAT 
AGATATTATAAAATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human SYNE1 mRNA sequence - var5 (public gi: 21734305) (SEQ ID NO: 187) 

CACTGGC^CGACCGCTCTGC^GAC^GCCT^ 

CTCAGCCCCTCCGGAGCGAGCGGTCAGGACGAGACACCCCAGCTAGTGTGGACTCCATCCCCCTGGAGTG 

GGATCACGACTATGACCTCAGTCGGGACCTGGAGTCTGCAATGTCCAGAGCTCTGCCCTCTGAGGATGAA 

GAAGGTCAGGATGACAAAGATTTCTACCTCCGGGGAGCTGTTGCCTTATCAGATGTAATGATCCCCGAGA 

GCCCTGAGGCCTATGTAAAACTCAC2AGAAAATGCAATCAAAAATACCT 

GTCACAGATCCGACAACTGGGCATU^GCCCTGGATGATAGCCGCTTTCAGA 

ATTCGCAGCAAAACTCCCACGGGGCCXX3AGCT 

AATGCAGTAGCAGTATAGACTCCGTGAAGAGACTGGAGCACAAAOTGAAGGAGGA^ 

TGGCTTTGTTAACCTGCATAGTACCGTW^CCCAAACGGCTGGTGTGATTGACCGATGGGAGCTTCTCCAG 
GCCCAGGCATTGAGG^GGAGTTGAGGATGAAGCAGAACCTC^ 

TGAACAGCATCTGGGCCTGGCTGGGGGACACGGAGGAGGAGTTGGAACAGCTCCAGCGTCTGGAACTCAG 

CACTGACATCCAGACCATCX3AGCTCCAGATCAAAAAGCTCAAGGAGCTCCAGAAAGCTO 

AAAGCC7VTCATCCTCTCCATCAATCTCTGCAGCCCTGAGTTCACCCAGGCTGACA 

ACCTGCAGGATCGCTTGTCGCAGATGAATGGGCGCTGGGACCGAGTGTGCTCTCTGCTGGAGGAGTGGCG 
GGGCCTGCTGCAGGATGCCCTGATGCAGTGCCAGGGTTTCCATGAAA 

CTGGAGAACATTGACAGAAGGAAAAATGAAATTGTCCCTATTGATTCTAACCTTGATGC^GAGATACTTC 
AGGACCATCACAAACAGCTTATGCAAATAAAGCATGAGCTC 
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GCAAGACATGTCTTGCCAACTACTGGTGAATGCTC 
CATGTTATTGGAAATCGKCTCAAACTTCT 
TATTAGACGTGTCAAGTAGTCAGCAGGATTTGTCTTCCT^ 
GTCTGTGAGTCCCACATC^GGAAGGAGCACCCCAAACAGA<^GAAAACGCCACG 

TCACAGCCTGGACCCTCTGTCAGCAGTCCACATAG^ 

CTGAGCCAGGGCCAGGTCGGTCCGGCCGCGGCTTCCTGTTCAGAGTCCTCCGAGCAGCTCTTCCCCTTCA 

GCTTCTCCrGCrCCTCCTCATCGGGCTTGCCTGCCTTGTACC^TGTCAGAOT 

CTCTCCAACAACTTTGCCCGGTCATTCCACCCCATGCTCAGATACACGAATGGCCCTCCTCCACT 

CTAAGCAGATGCCATCTGCAGAAGTGCTGGTAGCATAAGGAGGATCGGGTCATAAGCAATCCCAAACTAC 

CAACAAGAGGACCTTGATCTTGGCGAAAGCCATCGGTGTGGCAGCTTTAGCCCTCCTCCAGATCACATGT 

GTGCAAATTATGGCTT(^GAGGTGGAAGATAAACAGTGACGGGGGAACAAACAGACAAC^GAAGGTTTG 

GAAGAAATCTGGTTTGAGACTCTGAACCTTAGCACTAAGGAGATTC 

CLACTGATGAATTCTGGGCCCTTGGCCCATTCT 

TTTCCCATGGTGCTGCTCCAACCATCAGATAAATGACCCTCCCAAGCACC^ 

ACCAACCAACCAGTGCTGAAGAGATTTTAGAACCTTGTAAC^TACAATTTTTAAG 

TTCCTTTTTACCTTGTTTTCCTTTGGGGCATGATGTTTTAACCTTTGCTTTAGAAGCACAAGOT 

CTAAAAGGCACTTTTTTTTAGAGGTATAAAGAAAAACTAGATGTAATAAATAAGATCATGGAAGGCTTTA 

TGTGAAAAAAGTTGAATGTTATAGTAAAAAAAAAAAGATATTTATGTATGTACAGTTTGCTAAAGCCAAG 

TTTTGTTTGTATTGATTTCTTTGCATTTATTATAGATATTATAAAATAAAAAAAAAAAAAAAAAAAA 

Human SYNE1 mRNA sequence - var6 (pubHc gi: 21750070) (SEQ ID NO: 188) 

TCAGAGGGTGCTCAATGCTTTCCTGAAAGCTTGTGATGAACTCACCGACATCCTTC 

CAGGGGCTGCAGGAAGCTGTTCGAAAGCTCCACAAACAATGGAAGGATCTTCAAGGAGAAGCCCCTTATC 
ATTTGCTTCATCTGAAGATTGATGTGGAGAAGAATAGGTTCTTAGCCTCTGCAGAAGAATGCAGAACTGA 
GCTGGATCGAGAGACCAAGCTGATGCCCC^GGAAGGCAGTGA7UVAGATAATTAAAGAGCACAGGGTTTTC 
TTGAGTGACAAAGGTCCTGATCATCTCTGTGAGAAJ^ 

TCCCAGTGCGGGACCCAGTAAGGGACACACCTGGAACCTGTCACGTGACTCTCAAAGAGCTCAGAGCTGC 
CATTGACAGCACCTACAGGAAGCTCATGGAAGACCCAGAC^GTGGAAGGACTACACTAGCAGATTCTCT 
GAGTTCTCATCTTGGATATCTACAAATGAGAGACAATT^ 

CCAACCACGGAGAGGTTAAACGTGCCGTTGAAGAGATCAGAAATGGTGTTACCAAAAGGGGTGAGACCCT 
CAGCTGGCTGAAATCCAGGCTGAAAGTTTTGACAGAAGTTTCTTCTGAGAATGAAGCCCAAAAGCAGGGA 
GATGAGCTGGCAAAATTATCCAGCTCTTTC^GGCTCTTGTGACGCTGCTGTCAGAGGTTGAAAAGATGC 
TAAGCAATTTTGGGGACTGTGTCCAGTACAAAGAAATAGTCAAA 

CTCTAAAGAAGTCCAGGAACAAGCTGAGAAGATCTTGGATACTGAAAATCTGTTTGAAGC^ 
CTTCHTC^TCACCAGGAAAAGACAAAGCGGATCTC^GC^ 

AGGCGCAGCAGGGAGAAGGGGGGCTGCCTGACCGAGGCCACGAGGAGCTGCGG7VAGCTGGA:GAGCACACT 
GGATGGCCTGGAGCGCAGCCGGGAGAGGCAGGAACGCCGCATCCAGGTCACATTAAGAAAATGGGAGCGA 
TTTGAAACAAACAAAGAAACAGTAGTAAGATACCTTOT^ 

GTTTTAGCAGTTTGGAAAGTTTATCTTCAGAACTGGAACAAACAAAGGAGTTTTCT 

TATTGCAGTCCAGGCTGAGAACCTTGTAAAGGAAGCnTCAGAGATACCGCTTGGGCCCCAAAATAAGCAG 
CTGCTTCAACAGCAGGCCAAGTCAATCAAAGAACAAGT 

ATGTGATTGACAAGTCCTAAACTTTCTTCTCTGAGATAAAGTTTCATACAATCTTTCCTGTACCTTGTAT 

TCAAAACACTCTTAAAATCTC^AAGTGTCTGTGTATTT 

AAAGAAAGTATCGCTAATACAGAAACGAATATCTATA 

TGCATCTTAAACTGATCATGTTCATGAGAAAGCCATA 

AGGGAATCTTGAAAAAGAACTAATATTTAAAATAATTTT^ 

AGCGAAAAGGAGATATTTTGTTAGTGTAGATTCCAGGCCTAA 

AACCTGAAGAAGCTCCTGGAGCTTGTTTACAGTGCCTC^ 

TCCAGAAATTAACTTTAAAATATTTTATTTTTAAACTTTTAATGTTTGTTTATCTG 

Human SYNE1 mRNA sequence - var7 (public gi: 28192521) (seq id no 

CATATACAGCTTGATAAATCTCTTCCTGCACCTC 
TGGCCCTGAGAGAGGAAATAACCGTTC^VACAGGTCCACGA 

TGAGCAACATAAGAGAAAATGCCGGACAATGATGGATCTGCTTCAAAACACGGATGC 
TTCCATGAAATCTACCGGACC^GGTCTGTTAACGGGATTCGAGTGCCACCT 
CCGAGAGGTTTCATTTTGTTTCCCCCACATCAGAGCTACACCTAATGAAAATGGA 
GTACOTTCTGCTCTGACTGCTGGTTCTTGCAGAGTCS^ 

AGAGAGTCAGTGGAGCAGCTTCTACAAAACTACGTGTCTTTTATAGAAAATAGCAAGTT 
ATGAGGTGACATACCAGATCTTGAAAC^GACAGCTGAGATGTATGTCAAAGCAGATGGTTCAGTGGAAGA 

AGCTGAGAATGTGATGAAATTCATGAATGAAACCACCGCT 

AGTGTGAGGAGCATGCTGGAAGAAGTCATCTCTAACTGGGATCGCTATGGCAATA(^GTGGCTAGTCTGC 
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TTTACCTCATTGGATTCAGCAGCATACTGCCATGAACGATGCTGGCAATTTTCTAATTGAAACCTGTGAT 
GAGATGGTTTCCCGTGACCTGAAGCAGC^TTACTGTTGCTAAATGGGCGGTGGAG 

AAGTCAAGCAATATGCTCAAGCTGATGAGATGGACAGAATGAAGAAGGAATACACAGACTGTGTTGTTAC 
CCTGTCTGCTTTTGCAACGGAAGCCCATAAGAAACTTTCTGAACCCTTAGAAGTCTCTTTTATGAATGTC 
AAGCTATTAATTCAAGACTTGGAGGATATTGAGCAGAGGGTGCCTGTGATGGATGCCCAATACAAGATAA 
TTACAAAGACAGCACACCTCATTACCAAAGAAAGCCCCCA 

GTCAAAGCTCAAAGAGC^GCTAACCAAGGTCAAAGAATGTTACTCCCCACTCCTTTATGAGT 

CTGTTGATTCCGTTGGAGGAATTAGAAAAGCAGATGACGTCCTTTTATGACTCACTTGGGAAAATCAATG 

AAATTATCACAGTTCTTGAGCGTGAGGCACAATCGAGTGCCCTTTTTAAAOVAAAACATCAGG 

Human SYNE1 1-nRNA sequence - var8 (public gi: 19584384) (seq id no: 190) 

AAGCTATTAATTCAAGACTTGGAGGATATTGAGCAGAGGGTGCCTGTGATGGATGCCCAATACAAGATAA 

TTACAAAGACAGCACACCTCATTACCAAAGAAAGCCCCCA^ 

GTGAAAGCTGAAAGAGCAGCTAACCAAGGTCAAAGA 

CTGTTGATTCCGTTGGAGGAATTAGAAAAGCAGATGACGTCCTTTTATGACTCACTTGGGAAAATCAATG 
AAATTATCACAGTTCTTGAGCGTGAGGCACAATCGA 

AGCTTGTCAAGAAAACTGTAAGAAAACCTTGACACTTATTGAGAAAGGCAGTCAAAGTGTTCA^ 
GTGACCTTGAGCAACGTGTTAAAGCATTTTGATCAGACGAGGCT 

TTGCnTTTCAGAGTATGGTAAAGAAAACTGGAGATTGGAAGAAGCATGTGGAAACCAACAGTCGCTTGAT 

GAAGAAGTTTGAGGAGTCTCGAGCAGAGTTGGAGAAGGTACTGCGGATTGCTCAGGAGGGCCTGGAGGAA 

AAGGGGGATCCAGAGGAGCTCCTGCGGAGACACACTGAGTTTTTCAGTCAGCTGGATCAGAGGGTC 

ATGCTTTCCTGAAAGCTTGTGATGAACTCACCGACATCCTTCCAGAGCAGGAGCAGCAGGGGCTGCAGGA 

AGCTGTTCGAAAGCTCCACAAACAATGGAAGGATCTTCAAGGAGAAGCCCCTTATCATTTGCTTCATCTG 

AAGATTGATGTGGAGAAGAATAGGTTCTTAGCCTCTGTAGAAGAATGCAGAACTGAGCTGGATCGAGAGA 

CCAAGCTGATGCCCCAGGAAGGCAGTGAAAAGATAATTAAAGAGCACAGGGTTTTCTTCAGTGACAAAGG 

TCCTCATCATCTCTGTGAGAAAAGGTTAGAGCTGATC 

CCAGTAAGGGACAC2VCCTGGAACCTGTCACGTGACTCTC 

ACAGGAAGCTCATGGAAGACCCAGACAAGTGGAAGGACTACACTAGCAGATTCTCTGAGTTCTCATCTO 
GATATCTACAAATGAGACACAATTAAAGGGGATCAAGGGTGAGGCCATCGATACTGCCAACCACGGAGAG 
GTTAAACGTGCCGTTGAAGAGATmGAAATGGTGTTACCAAAAGGGGTGAGACCCTCAGCTGGCTGAAAT 
CCAGGCTGAAAGTTTTGACAGAAGTTTCTTCTGAGAATGAAGCCCAAAAGCAGGGAGATGAGCTGGCAAA 

ATTATCGAGCTCTTTCAAGGCTCTTGTGAOT 

GACTGTGTCCAGTACAAAGAAATAGTCAAAAATTCTCTC^ 

AGGAACAAGCTGAGAAGATCTTGGATACTGAAAATCTGTTTGAAGC^ 

GCAAAAGACAAAGCX3GATCTCAGCAAAGAAGAGAGATGTGCAGCAGCAGATCGCGCAGGCGCAGCAGGGA 

GAAGGGGGGCTGCCTGACCGAGGCCACGAGGAGCTGCGGAAGCTGGAGAGC^ 

GCAGCCGGGAGAGGCAGGAACGCCGCATCCAGGTCACATT^ 

AGAAACAGTAGTAAGATACCTTTTTCAAACAGGTTCCAGTCATGAACGCTTCTTGAGTTTTAGCAGTTTG 

GAAAGTTTATCTTCAGAACTGGAACAAACAAAGGAGTTTTCTAAACGGACAGAAAGT 

CTGAGAACCTTGTAAAGGAAGCTTCAGAGATACCGCTTGGGCCCCAAAATAAGCAGCTGCTTCAA 

GGCCAAGTCAATCAAAGAAC^GTCAAAAAATTAGAAGACACGCTTGAAGAAGATATTAAACCCATGG^ 

ATGGTGAA7^CCAAGTGGGATCATTTTGG(^GTAATTTTGAGACTCTGTCCGTCTGGATAACTGAGAAAG 

AAAAAGAACTCAATGCCTTGGAAACTTCGTCATCTGCCATGGACATGCAAATCAGCCAAATTAAGGTCA^ 

AATTCAGGAAATAGAAAGTAAGCTCAG CAGCATTGTAGGATTAGAAGAAGAAGCCCAGT CTTTTGCTCAG 

TTTGTTACCACTGGAGAATCTGCTCGAATTAAAGCCAAGTTGACACAA^ 

TTCXSAGAGCATGCACAGTGTCTGGAAGGAACAATCCTGGGACATTTATCTCA^ 

GAACCTTAGAAAGATCCAGCAATCTGTGTCTGAATTTGAAGATAAACTTGCTGTTCCAATTAAAATATGT 

TCTTCAGCTACAGAAACATACAAAGTTCTTCAAGAACATA 

GCAGCGCGATCACTGCCTTCTCAGCCAGTGCC^ 

TG CGGCTCTACAGCAGC AATACGAGGACATC CTAA 

CTGCTGGCCCACTGGCAGAGGCTAGAGAAAGAACTATCATCCTTTC 

CTAAAGCC^GTTCCCCAGAAATGGACATTTCTGCAGACAGAGTCAAAGTGGAAGGTGAACTTCAGTTAAT 
ACAGGCAAGTT CAAGGAAGTGTGAGGAAGGAAAAAATAAAATG CTTTTTGTTACAGTTACATTATTT AAA 
ATAATAAAATAAATAAAACTTGTAAAAAAAAAAAAAAAAAAAAAAAAA 

Human SYNE1 raRNA sequence - var9 (public gi: 17861377) (seq id no: 191) 

AAGGTAAAGCCACTAGAGAGAAACTGAAAGAAAACATTCTTAAGA 
AAAATATGATTTATAGACCACAGATAGGAATTAAGAGTTTC 
AGGATTATCAAGAGGAAATTGCTATTGCTCAAGAGAACAAAATA^ 
TGCTAAAGCCAGCCATGAAAGCAAAGCATCTGAGATTGAATACAAGCTGGG 

CAGCATCTCCTGGACCTCATTGCAGCCAGGGTGAAGAAGCTGAAGGAGACCCTGGTAGCCGTGCAGCAGC 
TTGATAAGAACATGAGCAGCOTGAGGACCTGGCTCGCTCACATCGAGTCAGAGCTGGCCAAGCCAATAGT 
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CTACGATTCCTGTAACTCGGJU^GAAATACAGAGAAAGCTTAATGAGCAGCAGGAGCTTCAGAGAGACATA 
GAGAAGCACAGTACAGGTGTTGCATCTGTCCTCAACCTGTGTGAAGTCCT 

GTGCCACTGATGCCGAGTGTGACTCTATACAGCAGGCTACGAGAAACCrGGACCGGCGGTGGAGAAACAT 

TTGTGCTATGTCCATGGAAAGGAGGCTGAAAATCGAAGAGACGTGGCGATTGTGGCAGAAATTTCTGGAT 

GACTATTCACGTTTTGAAGATTGGCTGAAGTCTTCAGAAAGGACAGCTGCTTTTCCCAGCT^ 

^TGATCTATACAGTTGCCAAGGAAGAACTAAAGAAATTTGAGGCTTTCCAGCGACAGGTCCACGAGTC 

• G ACGCAGCTGG AACTGATCAACAAGCAGTACCGCCG CCTGGC CAGGGAGAACCGCACTGATTCAGCATGT 

^^GCCTCAAACAGATGGTTCACG^^ 

TGCGCAGACTCAAGCATTTTATTGGCCAGCGTGAGGAGTTTGAGACTGCGCGGGACAGCATTCTGGTCTG 

GCTCACAGAGATGGATCTGC^GCTCACTAATATTGAACATTTTTCTG^ 
AAGCAACTCAAGGCCTTCCAGCAGGAAATTTCACTGAA.CCACAATAAGATT 

GAGAACAGCTGATAGAAAAGAGTGAGCCCTTGGATGCAGCGATCATCGAGGAGGAACTAGATGAGCTCCG 

ACGGTACTGCCAGGAGGTCTTCGGGCGTGTGGAAAGATACCATAAGAAACTGATCCGCCTGCCTCTCCCA - 

GACGATGAGCACGACCTCTCAGACAGGGAGCTGGAGCTGGAAGACTCTGCAGCTCTGTCGGACCTGCACT 

GGCACGACCGCTCTGCAGACAGCCTGCTTTCTCCACAGCCTTCCTCGAATCTCTCCCTCTCGCT 

GCCCCTCCGGAGCGAGCGGTCAGGACGAGACACCCCAGCTAGTGTGGACTCCATCCCCCTGGAGTGGGAT 

CACGACTATGACCTCAGTCGGGACCTGGAGTCTGCAATGTCCAGAGCTCTGCCCTCTGAGGATGAAGAAG 

GTC^GGATGAC7VAAGATTTCTACCTCCGGGGAGCTGTTGCCTTATCAGATGTAATGATCCCCGAAAGCCC 

TGAGGCCTATGTAAAACTCACAGAAAATGCAATCAAAAATACCTCCGGGGACCACAGTGCCCTAGAGTCA 

CAGATCCGACAACTGGGCAAAGCCCTGGATGATAGCCGCTTTCAGATACAGCAAACCGAAAATATCATTC 

GCAGCAAAACTCCCT^CGGGGCCGGAGCTAGACACCAGCTACAAAGGCTACATGAAACTGCTGGGCGAATG 

C^GTAGCAGTATAGACTCCGTGAAGAGACTGGAGCACAAAOTGAAGGAGGAAGAGGAGAGCCTTCCTGGC 

TTTGTTAACCTGCATAGTACCGAAACCCAAACGGCTGGTGTGATTGACCGATGGGAGCTTCTCCAGGCCC 

AGGCATTGAGCAAGGAGTTGAGGATGAAGCAGAACCTCCAGAAGTGGCAGCA^jTTTAACTCAGACTTGAA 

CAGCATCTGGGCCTGGCTGGGGGACACGGAGGAGGAGTTGGAAC^GCTCCAGCGTCTGGAACTCAGCACT 

GACATCCAGACCATCGAGCTCCAGATCAAAAAGCTC^ 

CCATCATCCTCTCCATCAATCTCTGCAGCCCTGAGTTCACCCAGGCTGACRGCAAGGAGAGCCGGGACCT 
GCAGGATCGCTTGTCGCAGATGAATGGGCGCTGGGACCGAGTGTGCTCTCTGCTGGAGGAGTGGCGGGGC 
CTGCTGCAGGATGCCCTGATGCAGTGCCAGGGTTTCCATGAAATGAGCCATGGTTTGCTTCTTATGCTGG 
AGAAC^TTGACAGAAGGAAAAATGAAATTGTCCCTATTGATTCTAACCTTGATGCAGAGATACTTCAGGA 
CCATCACAAACAGCTTATGCAAATAAAGCATGAGCTGTTGGAAT 

GACATGTCTTGCCAACTACTGGTGAATGCTGAAGGAACAGACTGTTTAGAAGCCAAAGAAAAAGTCCATG 
TTATTGGAAATCGGCTCAAACTTCTCTTGAAGGAGGTCAGTCGTCATATCAAGGAACTGGAGAAGTTATT 

AGACGTGTCAAGTAGTCAGCAGGATTTGTCTTCCTGGT 

GTGAGTCCCACATCAGGAAGGAGCACCCG?^AACAGACAGAAAACGCCACGAGGCAAGTGTAGTCTCTCAC 

AGCCTGGACCCTCTGTCAGC^GTCCACATAGCAGGTCCACAAAAGGTGGCTCCGATTCCTCCCTTTCTGA 

GCCAGGGCCAGGTCGGTCCGGCCGCGGCTTCCTGTTCAGAGTCCTCCGAGCAGCTCTTCCCCTTCAGCT^ 

CTCCTGCTCCTCCTCATCGGGCTTGCCTGCCTTGTACCAATGTCAGAGGAAGACTACAGCTGTGCCCTCT 

CCAACAACTTTGCCCGGTCATTCCACCCCATGCTCAGATACACGAATGGCCCTCCTCCACTCTGAACTAA 

GCAGATGCCATCTGCAGAAGTGCTGGTAGCATAAGGAGGATCGGGTCATAAGCAATCCCAAACTACCAAC ' 

AAGAGGACCTTGATCTTGGCGAAAGCCCTCGGTGTGGGAGCT^ 

AAATTATGGCTTCAGAGGTGGAAGATAAACAGTGACGGGGGAACAAACAGACAACAAGAA.G 

AAATCTGGTTTGAGACTCTGAACCTTAGCACTAAGGAGATTGAGTAAGGACCTCCAAAGTTCCCCGGACT 

CATGAATTCTGGGCCCTTGGCCCATTCTGTGCACAGCCAAGGACTTCAGTAGACCATCTGGGCAGCTTTC 

CCATGGTGCTGCTCCAACCZATCAGATAAATGACCCTCCCAAGCACCATGTCAGTGTCGTACAATCTACCA 

ACCAACCAGTGCTGAAGAGATTTTAGAACCTTGTAACATAC7VATTTTTAAGAGCTTATATGGCAGCTTCC 

TTTTTACCTTGTTTTCCTTTGGGGCATGATGTTTTAACCTTTGCTTTAGAAGCACAAGCTGTAAATCTAA 

AAGGC^CTTTTTTTTAGAGGTATAftAGAAAAACTAGATGTAATAAATAAGATCATGGAAGGCTTTATGTG 

AAAAAAGTTGAATGTTATAGT 

Human SYNE1 mRNA sequence - varlO (public gi: i786i385) (seq id no 

CAAAAATCAGTOTGATCTCGGGAAACCTGGAGAAATTTATTTTCTGTACTCTAATGTTCTTTC^TTTTGG 

TGACCATCAAGGTGCTGGGAGAGGAATTAGATGGCTGTAATTCAAAGTTAATGGAATTAGATGCAGCAGT 

ACAGAAATTCTTGGAACAGAATGGCCAACTGGGTAAGCCACTGGCCAAGAAGATAGGAAAACTGACTG^ 

CTTCACCAGCAGACGATTAGACAAGCTGAGAATCGGCTCTCCAAGCTCAATCAGGGAAC^ 

AAGAATACAATGAAATGCTTGAATTAATTTTGAAGTGGATTGAAAAAGCTAAAGTCTTGGCTC^TGGAAC 

TATTGCATGGAATTCTGCAAGCCAGCTTOTGAAACAATATATTTTGCATCAGACCCTGCTA 
AAAGAAATTGACAGTGAGCTGGAAGCAATGACTGAGAAATTACAGTAC^ 

AAAAAATGTCTCAGCAAGTGGCAGAACTGGGACGGGAGACTGAGGAGTTGCGACAGATGATCAAAATTC 

TTTGC^GAACCTCCAAGATGCAGCTAAGGATATGAAAAAATTTGAAGCAGAGTTGAAAAAGTTACAAGCT 

GCCTTGGAGCAAGCCC^GGCAACACTGACrrTCTCCAGAAGTTGGACGTCTCAGTCTCAAGGAGCA 

CTCATCGGCAGCATTTGTTGTCTGAGATGGAGTCACTGAAGCCGAAGGTGCAAGCAGTGCAGCTCTGCCA 

GAGTGCCCTCCGGATCCCCGAGGATGTGGTTGCCAGCTTACCTCTCTGTCATGCTGCTCTGCGGCTGCA 
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GAAGAGGCGAGCCGGCTGCAGCACACCGCCATCCAGC^^ 

ATGAACAATATGAGCAAGAAATGAAAC^TCTCCAGCAACTGATAGAAGGAGCTCACAGAGAGATTGAGGA 
TAAACCTGTTGCC^CCAGTAACATACAGGAGCTG(^GGCTCAGATTTCTCGGCATGAGGAGCTGGCGCAG 
AAAATTAAGGGCTACCAGGAGCAGATCGCTTCTTTGAATTCCAAGTGCAAGATGCTGACGATGAAAGCCA 
AGC^CGCCACCATGOTGCTGACGGTGACCGAGGTCGAGGGGCTGGCGGAAGGGACAGAGGACCTGGATGG 
GGAGCTCCTCCCCACGCCTTCGGCCCACCCCTCTGTGGTCATGATGACTGCAGGTCGCTGTCACACTTTG 
CTGTC^CCGGTCACTGAGGAGTCTGGGGAGGAGGGAACCAACAGTGAGATTTCCTCTCCACCTGCCTGTC 
GCTCCCCTTCACCTGTGGCTAATACAGATGCTTCTGTTAACC^GGACATTGCATATTACCAAGCCTTGTC 

TGCTGAGAGGTTGCAGACAGATGCTGCAAAAATTCACCC 

CCGGGATTGGAGCC^TCCGCTACTGCCAAACTGGGTGATTTGCAGCGTTCTTGGGAAACCTTAAAGAATG 
TGATCAGTGAGAAGCAGCGC^CACTCTATGAAGCTTTGGAGCGCCAGCAGAAGTACCAGGACTCCCTCCA 
• GTCCATCTCTACGAAGATGGAGGCCATTGAGCTGAAACTCAGTGAGAGCCCAGAGCCTGGCAGGAGTCCA 
GAAAGCCAGATGGCTGAACATCAGGCATTGATGGATGAGATTCTCATGCTCCAGGATGAAATCAATGAGC 

TCCAGTCCTCTCTCGCAGAGGAGCTGGTATCC^^ 

GCAGTCCACGCTCACTGTCTTAGCCGAGCGAATGTCCACCATCAGGATGAAAGCCTCGGGGAAACGGCAG 
CTTTTGGAGGAGAAGTTGAATGATCAGCTGGAGGAACAAAGGCAGGAACAGGCCCTGCAGAGGTATCGCT 
GTGAAGCCGATGAGCTGGACAGCTGGCTCTTGAGTACCAAGGCCACTCTGGACACTGCGCTGAGTCCACC 
CAAGGAGCCCATGGACATGGAGGCCCAGCTTATGGACTGCCAGAATATGCTGGTGGAAATAGAGCAGAAG 

GTGGTGGCTTTATCAGAACTGTCAGTCCACAATGAGA^ 

ACGAGGCCGAGCAGCTGGCTGGAAAGCTGAGAAGGCTCAAGGGGAGCCTGCTGGAGCTGCAGAGAGCCCT 
GCATGATAAGCAGCTCAACATGCAGGGAACAGCACAGGAGAAGGAGGAGAGCGATGTTGACCTAACAGCC 
ACGCAGAGCCCCGGCGTCCAGGAATGGCTGGCCCAAGCTCGCACCACATGGACCCAGCAGCGGCAGAGCA 
GTCTCCAGCAACAAAAAGAGTTAGAACAGGAATTAGCCGAGCAGAAGAGTCTCCTTCGCTCAGTAGCCAG 
TCGTGGAGAGGAGATTCTAATTCAACATTGGGCGGCAGAGACCTCTGGTGATGCTGGCGAAAAACCTGAT 
GTGTTATCCCAGGAGTTGGGGATGGAAGGGGAGAAATCATCCGCTGAAGACCAGATGAGAATGAAATGGG 
AAAGCCTACATCAAGAATTTAGTACCAAGCAGAAACTACTACAGAATGTTCTGGAACAGGAACAAGAGCA 
AGTGCTTTATAGCAGGCCAAATCGACTCTTGTCTGGTGTGCCACTGTACAAAGGGGACGTGCCAACCCAA 
6ATAAATCTGCAGTTACATCTTTGCTGGATGGACTGAACCAAGCCTTCGAGGAGGTTTCATCCCAGAGTG 
GAGGGGCAAAGAGGCAGAGTATACACTTGGApCAGAAGTTGTATGATGGAGTCTCAGCCACCTCTACTTG 
GTTGGATGACGTTGAAGAACX5TTTATTTGTTGCCACAGCACTTTTACCAGAAGAAACAGAGACTTGTCTC 

TTCAACCAAGAGATTCTTGCCAAAGACATTAAGGAAATGTC^^ 

TTTCCCAAGCTTTTCCAGAGAATGGTGATAATCGAGATGTTATTGAAGATACTTTGGGTTGTCTTTTGGG 
CAGGTTATCCTTGCTAGACTCAGTAGTGAATCAACGATGTCATCAGATGAAAGAAAGACTTCAGCAAATA 

CTAAATTTCCAGAATGATCTGAAAGTGCTC^ 

AACTGGCAAATGTGTTTGAACAGCCCGTAGCAGAACAAATAGAGGCAATACAAC^GGCTGAAGATGGACT 

CAAAGAATTTGATGCAGGAATCATTGAATTAAAGAGGCGTGGTGACGAGCTACAGGTCGAGCAGCCGTCC 

ATGCAAGAACTCTCCAAGCTCCAGGACATGTATGATGAGCTGATGATGATCATTGGCTCCCGGAGGAGTG 

GTCTGAATCAGAACCTTACACTCAAGAGTCAGTATGAGAGGGCCCTACAAGATCTGGCTGACCTGCTAGA 

AACTGGTCAGGAGAAGATGGCAGGAGACCAGAAAATCATCGTGTCTTCC7VAAGAGGAAATCCAGCAACCA 

CTTGACAAACATAAGGAATACn^TCAGGGCCTGGAATCTCATATGATCTTGACTGTAAC 

AGATAATCAGCTTTGGAGTCCAAAAGGAAACCCAGTTCCATACAGAGCTGATG 

ACTGAAACGGGCTCACAAGAGGGGTGTGGAGCTGGAGTACATTCTAGAGACGTGGTCCCATCTGGATGAG 
GACCAGCAGGAGCTCAGCAGACAGCTGGAGGTGGTGGAAAGCAGCATCCCAAGCGTGGGTCTGGTGGAGG 
AGAACGAGGACAGGCTTATTGACCGCATAACACTCTACCAGCATTTAAAATCTAGCCTTAATGAATACCA 
GCCCAAATTATATCAAGTATTAGATGATGGGAAACGACTTCTGATATCCATCAGCTGCTCAGATCTAGAA 
AGCCAACTAAATCAACTTGGAGAGTGCTGGCTAAGTAACACCAATAAAATGTCTAAGGAACTTCACAGAC 
TGGAAACAATATTGAAACACTGGACCAGATATCAAAGTGAATCT 

TGCAAAAGACCGGCTAGAATTTTGGACTCAGCAATCTGTGACAGTCCCACAAGAGCTGGAAATGGTCCGT 
GATCATCTAAATGCTTTCCTGGAGTTTTCTAAAGAAGTGGATGCCCAATCTTCCCTGAAATCATCTGTTC 
TGAGTACTGGAAATC^GCTCCTTCGACTAAAAAAGGTGGACACAGCCACGCTGCGCTCTGAGTTGTCGCG 
CATTGATAGCCAGTGGACTGACCTGCTAACCAAra^ 

ATGGATAAACTGCCTTCCCGCCATGCCATTTCTGAAGTCATGAGTTGGACTTCTCTAATGGAAAATGCTA 
TTCAGAAGGATGAAGATAATATTAAAAATTCCATAGGTTACAAGGCAATTCATGAATACCTTCAGAAATA 
TAAGGGTTTTAAGATAGACATTAACTGTAAACAGCTGAC^GTGGATTTTGTGAACC^GTCCGTGCTAC^ 
ATCAGCAGTCAGGATGTGGAAAGTAAGCGTAGTGATAAGACTGA^ 

ATAAAAGTTGGCAAATTCTGCAAGGTCTAGTAACTGAGAAGATCCAGCTGTTGGAAGGCTTATTGGAATC 
TTGGTCAGAATATGAAAATAATGTACAATGTCTGAAAACATGGTTTGAAACCC^GGAAAAGAGACTAAAA 
CAACAGCATCGAATTGGAGATCAGGCTTCTGTTCAAAATGCACTGAAAGACTGTCAGGATCTGGAAGATC 
TGATTAAAGCAAAAGATAAAGAAGTAGAGAAAATTGAGCAGAATGGACTTGCTTTGATTCAGACCAAGAA 
AGAAGACGTCTCTAGCATTGTCATGAGCACACTXSCGAGAGCTCC^ 

ATGGTTGGLACAATTAAAGATACTGCTGAAATCAGTGCTTGACCAATGGAGTAGTCACAAAGTGGCCTTTG 
AC^GATA^(^GTTACCTCATGGAGGCC^GATACTCTCTTTCCCGATTCCGTCTGCTGACTGGCTCCTT 
AGAAGCTGTGCAAGTTCAGGTGGACAATCTTCAGAATCTCCAAGATGATCTGGAAAAACAGGAAAGGAGC 
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TTACAGAAATTTGGCTCTATCACCAACC^TTATTAAAAGAGTGTCACCCACCCGTGACAGAAACTCTTA 
CCAATACACTGAAAGAAGTCAACATGAGATGGAATAACTTGCTGGAAGAGATTGCTGAGCAGCTACAGTC 
CAGCAAGGCCCTACTTGAGCTTTGGCAAAGATACAAGGACTACTCCAM 

CAG<^GGAGGATCGAACCAATGAGCTGTTGAAGGCAGCCACAAACAAGGAC^TTGCCGATGATGAGGTTG 
CCAC^TGGATTCAAGATTGCAACGACCTCCTCAAAGGACTGGGCACAGTTAAAGATTCCCTCTTTGTT 
CCATGAGCTGGGAGAGCAACTGAAGCAACAAGTGGATGCTTCCGCAGCATCAGCTATTCAATCGGATCAA 
CTCTCTTTGAGTCAACACTTGTGTGCCCTGGAGCAAGCTCTCTC 

GAGTTCTTGATTATGAAACCTTTGCCAAGAGTTTAGAAGCTTTGGAGGCCTGGATAGTGGAAGCTGAAGA 

AATACTACAAGGGCAGGACCCTAGC(^CTCATCTGACCTCTCCAC^TCCAGGAAAGGATGGAAGAACTT 

AAGGGACAGATGTTAAAATTCAGCAGCATGGCTCCAGATTTA 

TACCCTTGAATGATAAGGAAATCAAAAGAATGCAGAATCTGAACC^ 

GACTACAGAAAGATTCAGCAAGTTGCAGTCATTTTTGCT 

ACATGGATGGAATTCCTAGTTC7VGACAGAACAAAAGTTAGCAGTAGAGATTTCAGGAAATT ATC AGCACC 
TTTTGGAACAGCAGAGAGCACACGAGTTGTTTCAAGCC^^ 

AATCATTATTGATGGGCAACGTCTTCTAGAACAAGKSTCAAGTTGATGACAGGGATGAAT^ 

TTGACACTCCTCI\GTAATCAATGGCAGGGAGTGATTCGCAGGGCCCAGCAGAGGCGGGGGATCATTGACA 

GCCAGATTCGCCAGTGGCAGCGCTATAGGGAGATGGCAGAAAAGCTTCGTAAATGGTTGGTTGAAGTGTC 

CTACCTCCCC^TGAGTGGTCTCGGAAGTGTTCCTATACCACTGCAACAAGCAAGGACCCTCTTTGAT 

GTGCAGTTCAAAGAAAAAGTGTTTCTGCGGCAAGAAGGGAGCT 

AACTCCTTCTCTCGGCGGACAGTGGCGCTGAGGCCGCCTTGCAGGCCGAACTCGCTGAAATCCAAGAGAA 

ATGGAAATCAGCCAGCATGCGGCTGGAAGAACAGAAGAAAAAACTAGCCTTCTTGTTGAAAGACTGGGAA 

AAATGTGAGAAAGGAATAGCAGATTCCCTGGAGAAACTACGAACTTTCAAAAAGAAGCTTTCGCAGTCTC 

TCCCGGATCACCATGAAGAGCTCCATGCAGAACAAATGCGTTGCAAGGAATTAGAAAATGCAGTTGGGAG 

CTGGACAGATGACTTGACCCAGTTGAGCCTGCTGAAGGACACCCTC 

ATCTCCATTCTTAATGAACGCGTAGAGCTTCTGCAAAGGCAGTGGG 

TAAGGCGGCAGCAAATAGGTGAAAGATTGAATGAATGGGCAGTCTTCAGTGAAAAGAACAAGGAACTCTG 
TGAGTGGTTGACTCAAATGGAAAGCAAAGTTTCTCAGAATGGAGACATTCTCATTG/^AGAAATGATAGAG 
AAGCTCAAGAAGGATTATCAAGAGGAAATTGCTATTGCTCAAGAGAACAAAATACAGCTC^ 
GAGAACGACTTGCTAAAGCCAGCCATGAAAGCAAAGCATCT^ 

CGACCGGTGGCAGCATCTCCTGGACCTCATTGCAGCCAGGGTGAAGAAGCTGAAGGAGACCCTGGTAGCC 

GTGCAGCAGCTTGATAAGAACATGAGCAGCCTGAGGACCTGGCTCGCTCACATCGAGTCAGA 

AGCCAATAGTCTACGATTCCTGTAACTCGGAAGAAATACAGAGAAAGCTTAATGAGCAGCAGGAGCTTCA 

GAGAGACATAGAGAAGCACAGTACAGGTGTTGCATCTGTCCTCAACCTGTGTGAAGTCCTGCTGCACGAC 

TGTGACX5CCTGTGCCACTGATGCCGAGTGTGACTCTATACAGCAGGCTACGAGAAACCTGGACCGGCGGT 

GGAGAAACATTTGTGCTATGTCGATGGAAAGGAGGCTGAAAATCGAAGAGACGTGGCGATTGTGGCAGAA 

ATTTCTGGATGACTATTCACGTTTTGAAGATTGGCTGAAGTCTTCAGAAAGGACAGCTGCTTTTCCCAGC 

TCTTCTGGGGTGATCTATACAGTTGCCAAGGAAGAACTAAAGAAATTTGAGGCTTTCCAGCGACAGGTCC 

ACGAGTGCCTGACGCAGCTGGAACTGATCAAC^AGCAGTACCGCCGCCTGGCCAGGGAGAACCGCACTGA 

TTGAGCATGTAGCCTCAAACAGATGGTTCACGAAGGCAACCAGAGATGGGAGAACCTGCAA 

ACCTCCATCTTGCGCAGACTCAAGCATTTTATTGGCCAGCGTGAGGAGTTTGAGACTGCGCGGGACAGCA 

TTCTGGTCTGGCTCACAGAGATGGATCTGCAGCTCACTAATATTGAACATTTTTCTGAGTGTGATGTTCA 

AGCTAAAATAAAGCAACTCAAGGCCTTCCAGCAGGAAATTTCACTGAACCACAATAAGATTGAGCAGATA 

ATTGCCCAAGGAGAACAGCTGATAGAAAAGAGTGAGCCCTTGGATGCAGCGATCATCGAGGAGGAACT 

ATGAGCTCCX3ACGGTACTGCCAGGAGGTCTTCGGGCGTGTGGAAAGATACCATAAGAAACTGATCCGCCT 

GCCTCTCCGAGACGATGAGCACGACCTCTCAGACAGGGAGCTGGAGCTGGAAGACTCTGCAGCTCTGTCG 

GACCTGCACTGGCACGACCGCTCTGCAGACAGCCTGCTTTCTCCACAGCCTTCCTCCAATCTCT 

CGCTCGCTCAGCCCCTCCGGAGCGAGCGGTCAGGACGAGACACCCCAGCTAGTGTGGACTCCATCCCCCT 

GGAGTGGGATCACGACTATGACCTC^GTCGGGACCTGGAGTCTGCAATGTCCAGAGCTCTGCCCTCTG 

GATGAAGAAGGTCAGGATGACAAAGATTTCTACCTCCGGGGA 

CCCTAGAGTCACAGATCCGACAACTGGGCAAAGCCCTGGATGATAGCCGCTTTCAGATACAGCAAACCGA 

AAATATCATTCGCAGCAAAACTCCGA.CGGGGCCGGAGCTAGACACCAGCrACAAAGGCTACATGAAACTG 

CTGGGCGAATGCAGTAGCAGTATAGACTCCX3TGAAGAGACTGGAGCACAAACTGAAGGAGGAAGAGGAGA 

GCCTTCCTGGCOTTGTTAACCTGCATAGTACCGAAACCC^AACGGCrrG 

TCTC(^GGCCGAGGCATTGAGCAAGGAGTTGAGGATGAAGGAG 

TCAGACTTGAACAGCATCTGGGCCTGGCTGGGGGACACGGAGGAGGAGTTGGAACAGCTCCAGCGTCTGG 
AACTCAGCACTGACATCGAGACCATCGAGCTCCAGATCAAAAAGCTCAA^ 

CCACCGCAAAGCCATCATCCTCTCCATCAATCTCTGCAGCCCTGAGTTCACCCAGGCTGACAGC7^AGGAG 
AGCCGGGACCTGCAGGATCGCTTGTCGCAGATGAATC 

AGTGGCGGGGCCTGCTGCAGGATGCCCTGATGCAGTGCCAGGGTTTCCATGAAATGAGCCA^ 

TCTTATGCTGGAGAAC^TTGACAGAAGGAAAAATGAAATTGTCCCTATTGATTCTAACCTTGATGCAGAG 

ATACTTCAGGACCATCACAAACAGCTTATGCAAATA^ 

CCTCTTTGCAAGACATGTCTTGCCAACTACTGGTGAATGCTGAAGGAACAGACTGTTTAGAAGC 
AAAAGTCCATGTTATTGGAAATCGGCTCAAACTTCTCTTGAAGGAGGTCAGTCGTCATATCAAGGAACTG 
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GAGAAGTTATTAGACGTGTC^GTAGTCAG(^GGATTTGTCTTCCTGGTCTTCTGCTGATG 
CCTCAGGGTCTGTGAGTCCCACATCAGGAAGGAGCACCCCAAACAGACAGAAAACGCCACGAGGCAAGTG 
TAGTCTCTCACAGCCTGGACCCTCTGTCAGCAGTCCACATAGCAGGTCCACAAAAGGTGGCTCCGATTCC 
TCCCTTTCTGAGCCAGGGCCAGGTCGGTC 

CCCTTCAGCTTCTCCTGCTCCTCCTCATCGGGCTTGCCTGCCTTGTACCAATGTCA 
^CTGTGCC^TCTCCAACAACTTTGCCCGGT 
CTCTGAACTA&GCAGATGCCATCTGCAGAAGTGCTGGTJ^ 
'^Ai^CTACCAACAAGAGGACCTTGATCTTGGCGAAAGCCCTCGGTGTGGCAGCTTTAGCCCT 

CACATGTGTGCAAATTATGGCTTCAGAGGTGGAAGATAAACAGTGACGGGGGAACAAACAGACAACAAGA 

AGGTTTGGAAGAAATCTGGTTTGAGACTCTGAACCT 

TTCCCCGGACTCATGAATTCTGGGCCCTTGGCCCATTCXGTGCACAGCC^GGACTTCAGTAGACCATCT 
GGGCAGCTTTCCC^TGGTGCTGCTCCAACGATCAGATAA^^ 

ACAATCTACCAACC7VACCAGTGCTGAAGAGAT^ ' 
TGGCAGCTTCCTTTTTACCTTGTTTTCCTTTGGGGCATGATGTTTTAACCTTTGCTTTAGAAGCACAAGC 
TGTAAATCTAAAAGGCACTTTTTTTTAGAGGTATAAAGAAAAACTAGATGTAATAAATAAGATCATGGAA 
GGCTTTATGTGAAAAAAGTTGAATGTTATAGT 

Human SYNE1 raRNA sequence - varll (public gi: 17227153) (seq id no: 193) 

AACTCCTTCTCTCGGCGGACAGTGGCGCTGAGGCCGCCTTGCAGGCCGAACTCGCTGAAATCCAAGAGAA 
ATGGAAATCAGCCAGCATGCGGCTGGAAGAACAGAAGAAAAAACTAGCCTTCTTGTTGAAAGACTGGGAA 
AAATGTGAGAAAGGAATAGCAGATTCCCTGGAGAAACT^^ 
TCCCGGATCACGATGAAGAGCTCCATGCAGAACAAAT 

CTGGACAGATGACTTGACCCAGTTGAGCCTGCTGAAGGACACCCTCTCTGCCTATATCAGTGCTGATGAT 

ATCTCCATTCTTAATGAACGCGTAGAGCTTCTGCAAAGGCAGTGGGAAGAACTATGCCACCAGCTCTCCT 

TAAGGCGGCAGCAAATAGGTGAAAGATTGAATGAATGGGCAGTCTTCAGTGAAAAGAACAAGGAACTCTG 

TGAGTGGTTGACTCAAATGGAAAGC?^AAGTTTCTCAGAATGGAGACATTCTCATTGAAGAAATGATAGAG 

A^GCTCAAGAAGGATTATCAAGAGGAAATTGCTATTGCTCAAGAGAACAAAATACAGCTCCAACA^ 

GAGAACGACTTGCTAAAGCCAGCCATGAAAGCAAAGCATCTGAGATTGAATACAAGCTGGG 

CGACCGGTGGCAGCATCTCCTGGACCTCATTGCAGCCAGGGTGAAGAAGCTGAAGGAGACCCTGGTAGCC 

GTGCAGCAGCTTGATAAGAACATGAGCAGCCTGAGGACCTGGCTCGCTCACATCGAGTCAGAGCTGGCCA 

AGCCAATAGTCTACGATTCCTGTAACTCGGAAGAAATACAGAGAAAGCTTAATGAGCAGCAGGAGCTTCA 

GAGAGACATAGAGAAGCACAGTACAGGTGTTGCATCTGTCCTCAACCTGTGTGAAGTCCTGCTGCACGAC 

TGTGACGCCTGTGCGACTGATGCCGAGTGTGACTCTAT^ 

GGAGAAACATTTGTGCTATGTCGATGGAAAGGAGGCTGAAAATC 

ATTTCTGGATGACTATTCACGTTTTGAAGATTGGCTGAAGTCTTCAGAAAGGACAGCTGCTTTTCCCAGC 
TCTTCTGGGGTGATCTATACAGTTGCCAAGGAAGAACT^ 

ACGAGTGCCTGACGCAGCTGGAACTGATCAACAAGC^GTACCGCCGCCTGGCCAGGGAGAACCGCACTGA 
TTCAGCATGTAGCCTCAAACAGATGGTTCACGAAGGCAACCAGA 

ACCTCC^TCTTGCGCAGACTC^AGCATTTTATTGGCCAGCGTGAGGAGTTTGAGACTGCGCGGGACAGCA 
TTCTGGTCTGGCTCAGAGAGATGGATCTGCAGCTCACT/^ATATTGAACATTTTTCTGAGTGTGATGTTCA 
. AGCTAAAATAAAGCAACTCAAGGCCTTCCAGCAGGAAATTTCACTGAACCACA 

ATTGCCCAAGGAGAACAGCTGATAGAAAAGAGTGAGCCCTTGGATGCAGCX5ATCATCGAGGAGGAACTAG 
ATGAGCTCCGACGGTACTGCCAGGAGGTCTTCGGGCGTGTGGAAAGA 
GCCTCTCCCAGACGATGAGCACGACCTCTCAGACAGGGAGCTGG^^ 
GACCTGCACTGGC^CGACCGCTCTGCAGACAGCCTGCTTTCT 

CGCTCGCTCAGCCCCTCCGGAGCGAGCGGTCAGGACGAGACACCCCAGCTAGTGTGGACTCCATCCCCCT 
GGAGTGGGATCACGACTATGACCTCAGTCGGGACCTGGAGTCTGCAATGTCCAGAGCTCTGCCCTCTGAG 
GATGAAGAAGGTCAGGATGACAAAGATTTCTACCTCCGGGGAGCTGTTGCCTTATCAGGGGACCACAGTG 
CCOTAGAGTCACAGATCCGACAACTGGGCAAAGCCCTGGATGATAGCCX3CTTTCAGATACAGCAAACCGA 
AAATATCATTCGCAGCAAAACTCCCACGGGGCCGGAGCTAGAC^ 

CTGGGCGAATGCAGTAGCAGTATAGACTCCGTGAAGAGACTGGAGCACAAACTGAAGGAGGAAGAGGAGA 

GCCTTCCTGGCTTTGTTAACCTGCATAGTACCGAAACCCAAACGGCTGGTG 

TCTCCAGGCCCMGCATTGAGCAAGGAGTTGAGGATGAAGCAGAACCTCCAGAAGTGG 

TCAGACTTGAACAGCATCTGGGCCTGGCTGGGGGACACGGAGGAGGAGTTGGAACAGCTCCAGCGTCTGG 

AACTCAGCAOTGACATCGAGACCATCGAGCTCCAGATCAAAAAGCTCAAGGAGCT 

CCACCGGAAAGCCATCATCCTCTCCATCAATCT 

AGCCGGGACCTGCAGGATCGCTTGTCGCAGATGAATGGGCGCTGGGACCGAGTGTGCTOTCTGC 

AGTGGCGGGGCCTGCTGCAGGATGCCCTGATGCAGTGCCAGGGTTTC 

TCTTATGCTGGAGAACATTGACAGAAGGAAAAATGAAATTG 

ATACTTCAGGACCATCACAAACAGCTTATGCAAATAAAGCATGAGCT C CCAACTCAGAGTAG 

CCTCTTTGCAAGACATGTCTTGCGAACTACT 
AAAAGTCCATGTTATTGGAAATCGGCTCAAACTTCTCT 
GAGAAGTTATTAGACGTGTCAAGTAGTCAGCAGGATTTGT 
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CCTCAGGGTCTGTGAGTCCCACATC^^ 
TAGTCTCTCACAGCCTGGACCCTCTGTCAGCAGTCCAC^ 

TCCCTTTCTGAGCCAGGGCCAGGTCGGTCCX3GCCGCGGCTTCCTGTTCAGAGTCCTCCGAGCAGCTCTTC 
CCCTTCAGCTTCTCCTGCTCCTCCTCAT^ 

CTGTGCCCTCTCCAACAACTTTGCCCGGTCATTCCACCCCATGCTCAGATACACGAATGGCCCTCCTC^ 

CTCTGAACTAAGCAGATGCCATCTGCAGAAGTGCTGGTAGCATAAGGAGGATCGGGTCATAAGCAATCCC 

AAACTACCAACAAGAGGACCTTGATCTTGGCGAAAGCCCTCGGTGTGGCAGCTTTAGCCCTCCTCCAGAT 

CACATGTGTGCAAATTATGGCTTCAGAGGTGGAAGATAAACAGTGACGGGGGAACAAACAGACAACAAGA 

AGGTTTGGAAGAAATCTGGTTTGAGACTCTGAACCTTAGCACTAAGGAGATTGAGTAAGGACCTCCAAAG 

TTCCCCGGACTC^TGAATTCTGGGCCCTTGGCCC^TTCTGTGCACAGCCAAGGACTTraGTAGACCATCT 

GGGCAGCTTTCCCATGGTGCTGCTCC^CCATCAGATAAATGACCCTCCCAAGCACCATGTCAGTGTCGT 

ACAATCTACCAACCAACCAGTGCTGAAGAGATTTTAGAACCTTGTAACATACAA 

TGGCAGCTTCCTTTTTACCTTGTTTTCCTTTGGGGCATGATGTTTTAACCTTT^ 

TGTAAATCTAAAAGGCACTTTTTTTTAGAGGTATAAAGAAAAACTAGATGTAATAAATAAGATCATGGAA 
GGCTTTATGTGAAAAAAGTTGAATGTTATAGT 

Human SYNE1 mRNA sequence - varl2 (public git lessoies) (seq id no: 194) 

ACAAAAGAGCATTCCATGAAATCTACCGGACCAGGTCTGTTAACGGGATTCCAGTGCCACCTGATCAATT 
AGAGGACATGGCCGAGAGGTTTCATTTTGTTTCCTCCACATCAGAGCT^ 

TTAGAATTAAAGTACCGTCTGCTCTCACTGCTGGTTCTTGCAGAGTCAAAGCTGAAGTCTTGGATCATTA 
AGTACGGGAGGAGAGAGTCAGTGGAGCAGCTTCTACAAAACTACGTGTCTTTTATAGAAAATAGCAAGTT 

CTTTGAACAATATGAGGTGACATACCAGATCTTGA 

TCAGTGGAAGAAGCTGAGAATGTGATGAAATTCATGAATGAAACCACCGCTCAGTGGAGGAATCTCTCAG 
TAGAAGTGAGGAGTGTGAGGAGCATGCTGGAAGAAGTGATCTCTAACTGGGATCGCTATGGCAATACAGT 
GGCTAGTCTGCAAGCCTGGCTAGAGGATGCTGAAAAAATGCT 

TTTTTTCGAAATTTACCTCATTGGATTC^GCAGCATACTGCC^TGAACGATGCTGGCAATTTTCTAATTG 
AAACCTGTGATGAGATGGTTTCCCGTGACCTGAAGCAGCAATTACTGTTGCTAAATGGGCGGTGGAGGGA 
'GTTGTTTATGGAAGTCAAGCAATATGCTCAAGCTGATGAGATGGACAGAATGAAGAAGGAATACACAGAC 
TGTGTTGTTACCCTGTCTGCTTTTGCGACGGAAGCCCATAAGAAACTTTCTGAACCCTTAGAAGTCTCTT 
TTATGAATGTCAAGCTATTAATTCAAGACTTGGAGGATO 

ATACAAGATAATTACAAAGACAGCACACCTCATTACCAAAGAAAGCCCCCAAGAAGAAGGAAAAGAAATG 
TTTGCGACCATGTCAAAGCTCAAAGAGCAGCTAACCAAGGTCAAA 
AGTCTCAGCAGCTGTTGATTCOSTTGGAGGAA^ 
GAAAATG^TGAAATTATCACAGTTCnTGAGCGTGAGGCA^ 

CAGGAACTGTTAGCTTGTCAAGAAAACTGTAAGAAAACCTTGACACTTATTGAGAAA 
TTCAAAAGTTTGTGACCTTGAGCAACGTGTTAAAGCATTTTGAT 

AGATATTCATGTTGCTTTTCAGAGTATGGTAAAGAAAACTGGAGATTGGAAGAAGCATGTGGAAACCAAC 
AGTCGCTTGATGAAGAAGTTTGAGGAGTCTCGAGCAGAGTTGGAGAAGGTACTGCGGATTGCTCAGGAGG 
GCCTGGAGGAAAAGGGGGATCCAGAGGAGCTCCTGCGGAGACACACTGAGTTTTTCAGTCAGCTGGATCA 
GAGGGTGCTCAATGCTTTCCTGAAAGCTTGTGATGAACTCACCGACATCCTTCCAGAGCAGGAGCAGCAG 
GGGCTGCAGGAAGCTGTTCGAAAGCTCCACAAACAATGGAAGGT^ 

GCATCCTCAATGAAGGGAGAAGCTTGAGCGTGTTAAGGTCCAAATGTAAGAGAAATTTAGAAATTCCTGG 

AAAGTCACTGTAACTATTTCGCT'CATTTAAAAACT 

TG 



Human SYNE1 mRNA sequence - varl3 (public gi: 16553949) (seq id no: 195) 

atagtagaattattcattataatatttggct^ 

tttattttctgtactctaatgttctt^ 

tgtaattcaaagttaatggaattagatgcag cagtacagaaattcttgg aacagaatggc caactgggt a 
agccactggccaagaagataggaaaactgactgaacttcaccagcagaccattagacaagctgagaatcg 
gctctccaagctcaatcaggcagcatcacatttagaagaatacaatgaaatgcttg7uvttaattttg7^ag 
tggattgaaaaagctaaagtcttggctcatggaactattc 

aatatattttgcatcagaccctgctagaagaatccaaagaaattgacagtgagctggaagcaatgactga 
gaaattacagtacctcactagcgtgtactgtacagaaatu^tgtctcagcaa 
gagactgaggagttgcgacagatgatcaaaattcgtttgcagaacctccaagatgcagct 
aaaaatttgaagcagagttgaaaaagttacaagctgcct 

agaagttggacgtctcagtctcaaggagcagctctctcatcggcagca.'tttgttgtctgagatggagtca 
ctgaagccgaaggtggaagcagtgcagctctgccaga 

gcttacctctctgtcatgctgctctgcggctgcaggaagaggccagccggctgcagctvcaccgccatcm 
gcagtgtaacatcatgcaggaagctgtggtacaatatg^ 

caactgatagaaggagctcacagagagattgaggataaacctgttgccaccagtaacatacaggagctgc 
aggctcagatacacgaatggccctcctccactctgaactaagcagatgccatctgcagaagtgctggtag 
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CATAAGGAGGATCGGGTC^TAAGC^TCCC 

CGGTGTGGCAGCTTTAGCCCTCCTCCAGATCACATGTGTGCAAATTATGG 

CAGTGACGGGGGAACAAACAGACAACAAGAAGGTTTGGAAGAAATCTGGT^ 

ACTAAGGAGATTGAGTAAGGACCTCCAAAGTTCCCCGGACT^ 

TGCACAGCCAAGGACTTCAGTAGACCATCTGGGCAGCTTTC^ 

TGACCCTCCCAAGCACCATGTCAGTGTCGTAC^^ 

CTTGTAACATACAATTTTTAAGAGCTT^^ 

TGTTTTAACCTTTGCITTAGAAGCACA^ 

AAACTAGATGTAATAAATAAGATC^TGGAAGGCTTTATGTGAAAAAAGTTGAATGTTATAGTAAAAAAAA 

AAGATATTTATGTATGTACAGTTTGCTAAAG 

GATATTATAAAAT 

Human SYNE1 ibRNA sequence - varl4 {public gi: 12698056) (seq id no: 196) 

ACAACGGAAACTTGAGCAACATAAGGATCTGCTTCAAAACACX3 

ATCTACCGGACGAGGTCTGTTAACGGGATTCCAGTGCCACCTGATCAATTAGAGGACATGGCCGAGAGGT 
TTCATTTTGTTTCCTCCACATCAGAGCTACACCTAATGAAAATGGAATTTTTAGAATTAAAGTACCGTCT 
GCTCTCACTGCTGGTTCTTGCAGAGTCAAAGCTGAAGTCTTGGATCATTAAGTACGGGAGGAGAGAGTCA 
GTGGAGCAGCTTCTACAAAACTACGTGTCTTT^^ 

CATACCAGATCTTGAAACAGACAGCTGAGATGTATGTCAAAGCAGATGGTTCAGTGGAAGAAGCTGAGAA 
TGTGATGAAATTCATGAATGAAACCACCGCTCAGTGGAGGAATCTCTCAGTAGAAGTGAGGAGTGTGAGG 
AGCATGCTGGAAGAAGTGATCTCTAACTGGGATCGCTATGGCAATACAGTGGCTAGTCTGCAAGCCTGGC 
TAGAGGATGCTGAAAAAATGCTCAATCAATCAGAAAATGCCAAAAAGGATTTT 

TTGGATTCAGCAGI^TACTGCC^TGAACGATGCTGGCAATTTTCTAATTGAAACCTGTGATGAGATGGTT 
TCCCGTGACCTGAAGCAGCAATTACTGTTGCTAAATGGGCGGTGGAGGGAGTTGTTTATGGAAGTCAAGC 
AATATGCTCAAGCTGATGAGATGGACAGAATGAAGAAGGAATACACAGACTGTGTTGTTACCCTGTCTGC 
TTTTGCAACGGAAGCCCATAAGAAACTTTCTGA^ 

^TTCAAGACTTGGAGGATATTGAGCAGAGGGTGCCTGTGATGGATGCCCAATACAAGATAATTACAAAGA 
CAGCACACCTCATTACCAAAGAAAGCCCCCAAGAAGAAGGAAAAGAAATGTTTGCGACCATGTCAAAGCT 
CAAAGAGCAGCTAACGAAGGTCAAAGAATGTTACTCCC^ 

CCGTTGGAGGAATTAGAAAAGCAGATGACGTCCTTTTATGACTCACTTGGGAAAATCAATGJ^ 

CAGTTCTTGAGCGTGAGGCAGAATCGAGTGCCCTTT 

AGAAAACTGTAAGAAAACCTTGACACTTATT^^ 

AGCAACGTGTTAAAGCATTTTGATCAGACGAGGCTACAAAGACAGATTGCAGATATTC^ 

AGAGTATGGTAAAGAAAACTGGAGATTGGAAGAAGCATGTGGAAACCAACAGTCGCTTGATGAAGAAGTT 

TGAGGAGTCTCGAGCAGAGTTGGAGAAGGTACTGCGGATTGCTCAGGAGGGCCTGGAGGAAAAGGGGGAT 

CCAGAGGAGCTCCTGCGGAGACACACTGAGTTTTTCAGTCAGCTGGATCAGAGGGTGCTCAATGCTTTCC 

TGAAAGCTTGTGATGAACTCACCGAC^TCCTTCCAGAGCAGGAGCAGCAGGGGCrGCAGGAAGCTGTTCG- 

AAAGCTCCACAAACAATGGAAGGATCTTCAAGGAGAAGCCCCTTATCATTTGCTTCATCTGAAGATTGAT 

GTGGAGAAGAATAGGTTCTTAGCCTCTGTAGAAGAATGCAGAACTGAGCTGGATCGAGAGACCAAGCTGA 

tgccccaggaaggcagtgaaaagataattaaagagcacagggttttcttcagtgacaaaggtcctcatca 
tctctgtgagaaaaggttacagctcatcgaggaactctgtgtgaaactcccagtgcgggacccagtaagg 
gacacacctggaacctgtcacgtgact ctcaaagagctcagagctgccattgacag cacctacaggaagc 
tcatggaagacccagacaagtggaaggactacactagcagattctctgagttctcatcttggatatctac 
aaatgagacacaattaaaggggatcaagggtgaggccatcgatactgccaaccacggagaggttaaacgt 

gccgttgaagagatcagaaatggtgttaccaaaaggggtgagacc 
aagttttgacagaagtttcttctgagaatgaagcccaaaagcagggagatgagctggcaaaa 

ctctttcaa6gctcttgtgacgctgctgt<^ga 

CAGTACAAAGAAATAGTCAAAAATTCTCTCGAAGAATTAATTTCTGGCTCrrAAAGAAGTCCAGGAACAAG 
CTGAGAAGATCTTGGATACTGAAAATOTGTTTGAAGCACAGC^GTTAOT 

AAAGCGGATCT CAGCAAAGAAGAGAGATGTGCAGCAGCAGATCGCGCAGG c6cAGCAGGGAGAAGGGGGG 

CTGCCTGACCGAGGCCACGAGGAGCTGCGGAAGCTGGAGAGCACACTGGATGGCCTGGAGCGCAGCCGGG 

AGAGGCAGGAACGCCGCATCCAGGTCACATTAAGAAAATGGGAGCGATTTGAAACAA 

AGTAAGATACCTTTTTCAAAGA.GGTTCCAGTCATGAACGCTTCTTGAGTTTTAGCAG 

TCTTCAGAACTGGAACAAACAAAGGAGTTTTCTAAACGGACAGAAAGTATTGCAGTCCAGGCTO 

TTGTAAAGGAAGCTTCAGAGATACCX3CTTGGGCCCCAAAATAAGCAGCT 

AATCAAAGAACAAGTCAAAAAATTAGAAGACACGCTTGAAGAAGAGTATGTGATTGACAAGTC 
TTCTTCTCTGAGATAAAGTTTC^TACAATCTTTCCTGTACCTTG 

AAAGTGTCTGTGTATTTCAGCATGTTTTGAGGAAACAACTCACAGTTCAAAAGAAAGTATCGCTAATACA 
GAAACCAATATCTATAACAGAGCCCAAAAAATATAAAGGATGTGGGTTTTGCATCTTAAACTGATCATGT 
TCATGAGAAAGCCATATCTATTCTATTCTGTGGCCT 
AATATTTAAAATAATTTTTTTACTATATTATTCT 

TAGTGTAGATTCCAGGCCTAAATACACATCACATAGACCATATATCTCCAACCTGAAGAAGCT^ 
CTTGTTTACAGTGCCTCGGTATTCAAGTTATCCTGACTAATATC 
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ATTTTATTTTTAAACTTTTAATGTTTGTTTATCTG 

Human S YNE1 mRNA sequence - varl5 {public gi: 2895592) (seq id no; 197) 

C^CCTGCATAGTAACGAAACCAAACGGCTGGTGTGATTGACCGATGGGAGCTTCTCCAGGCCCAGGCAT 
. TGAGCAAGGAGTTGAGGATGAAGCftGAACCTCCAGAAGT^ 

* CTGGGCCTGGCTGGGGGACACGGAGGAGGAdTTGGAACAGCTCCAGCGTCTGGAACT 
CAGACCATCGAGCTCCAGATCAAAAAGCTCAAGGAGCTCCAGAAAGCTGTGGA 

- — "TCCTCTCCATCAATCTCTGCAGC^ 

TCGCTTGTCGCAGATGAATGGGCGCTGGGACCGAGTGTGCTCTCTGCTGGAGGAGTGGCGGGGCCTGCTG 

CAGGATQCCCTGATGCftQTOCCRGQQTTTC 
TTGACAGAAGGAAAAATGAAATTGTCCCTATTGATO 

CAAAC^GCITATGCAAATAAAGCATCAGCTC^ _ 
TCTTGCCAACTACTGGTGAATGCTGAAGGAACAGACTGTTTAG 

GAAATCGGCTCAAACTTOTCTTGAAGGAGGTCAGTCGTCATATCAAGGAACTGGAGAAGTTATTAGACGT 

GTCAAGTAGTCAGCAGGATTTGTCTTCCTGGTCTTCTGCTGA 

CCCACATCAGGAAGGAGCACCCCAAACAGACAGAAAACGCCACGAGGCAAGTG 

GACCCTCTGTCAGCAGTCCACATAGCAGGTCCACAAAAGGTGGCTCCGATTCCTCCCTTTCTGAGCCAGG 

GCCAGGTCGGTCCGGCCGCGGCTTCCTGTTCAGAGTCCTCCGAGCAGCTCnTCCCCTTCAGCTTCTCCTG 

CTCCTCCTCATCGGGCTTGCCTGCCTTGTACCAATGTCAGAGGAAGACTAGAGOT 

ACTTTGCCCGGTCATCCACCCCATGCTCAGATACACGAATGGCCCTCCTCCACTCTG 

CCATCTGC^GAAGTGCTGGTAGCATAAGGAGGATCGGGTCAT^ 

CCTTGATCTTGGCGATIAGCCATCGGTGTGGCAGCTTTAGCCCTCCTCCAGATCACATGTGTGCT^AATTAT 
GGCTTCAGAGGGTGGAAGATAAACAGTGACGGGGGAACAAACAGACAACAAGAAGGTTTGGAAGAAATCT 
GGTTTGAGACTCTGAACCTTAGCACTAAGGAGATTGAGTAAGGACCTCCAAAGTTCCCCGGACTCATGAA 
TTCTGGGCCCTTGGCATTCGTGTGCACAGCCAAGGACTTCAGTAGACCATCTGGGCAGCTTTCCCATGGT 

GCTGCTCC^CCATC^GATAAATGACCCTCCC 

QAGTGCTGAAGAGATTTTAGAACCTTGTAACATACAATTTTTAAGAGCTTATATGG^^ 
CCTTGTTTTCCTTTGGGGCATGATGTTTTAA 

ACTTTTTTTTAGAGGTATAAGAAAAACTAGATGTAATAAATAAGATC^TGGAAGGCTTTATO 
GTTGAATGTTATAGTAAAAAAAAAAA 

Human SYNE1 raRNA sequence - varl6 (public gi: 6330956) (seq id no: 198) 

CTCGATTTGTGCCGTCAGTCTAACAACCTGTGCTTGCAAAGGGAAG^ 
ACCATGACTGTATGAATGTTGTTGAAGTGTTCCTAGAAAAAT^ 

ATCTGATGC^GAGAGTACAGCTGTCCACCTGGAAGCTTTGAAAAAGTTAGCATTGGCATTGCAGGAGAGA 
AAGTATGCTATTGAAGATCTGAAAGATCAAAAGCAGAAAATGATA 

AGTTAGTCAAAGAACAGACGAGTCATTTAGAGCAACGTTGGTTTCAGCTTGAGGACCTCATTAAAAGGAA 
AATCCAAGTGTCAGTCACCAACTTGGAGGAGTTAAATO^ 

TGGGCAGAAGAGCAACAACCCAACATCGCCGAGGCCCTTAAGCAGAGCCCTCCTCC^GATATGGCTCAGA 
ACCTTCTCATGGATCACCTGGCCATCTGCAGTGAACTGGAGGCCAAGCAGATGCTCCTGAAATCGCTTAT 
AAAGGACGCAGAC^GGGTCATGGCAGATCTTGGTCTCAATGAGCGACAGGTCATCCAGAAGGCTCTCTCT 
GATGCACAAAGCCACGTGAATTGTCTCAGTGACTTAGTGGGC 

TGTCCGAGAAAACCCAGTTTCTCATCGCAGTGTTCCAGGCCACCAGCCAAATTCAGGAAC^ 
GATAATGTTCCGTGAACACATCTGTGTGTTACCAGATGAT^ 

GCACAAGCCAGCCTCAAGACTTACCAAAATGAAGTCAC^^ CGCGAACTAATGA 
AGGAAGTC^CAGAGCAGGAAAAGAGTGAAGTGCTGGGGAAGCTTCAGGAATTGCAGAGTGTCTATGACAG 
TGTTTTACAAAAGTGCAGTCACCGGTTACAAGAACTAGAGAAGAATTTGGTTTCTAGGAAGCATTTTAAG 
GAAGATTTTGATAAAGCTTGCGACTGGCTAAAACAAGCAGATATTGTTACATTTCCTGAAATCAACCTAA 
TGAATGAGAGTACTGAGCT^CMACACAACTGG 

TGAAAATCTTCTACTTACGCTGCAGAGAACTGGGCAGACC^TATTACCATCGGTGAATGAAGTCGATCAT 

TCCTACCTC^GTGAAAAGCTAAATGCTTTGCCTCGACAATTTAATGTAATTGTTGCCTTGGCT 

AGTTCTATAAAGTCCAGGAAGCAATTCTTGCTCGGAAGGAATATGCTTCCTTGA 

GTCTCTCAGTGAACTTGAAGCCGAATTCTTGAGGATGAGCAAA 

GCTCTTTCTCTGCAAGATGGTTGCAGAGCCATTCT^^ 

AACTGAACCAGAAAAAAGAAGGTTTTCGCAGCAC^GGTCAGCCTTGGCAGCCAGA 

TGTCACCTTATAT(^CAGGCTGAAGCGACA7VACAGAA(^GAGGGTTAGCTTATTAGAAGACACCACCAGT 
GCTTACCAAGAACACGAGAAGATGTGCCAACAGCTGGAGAGACAACTGAAGTCTO 

CCAAAGTGAATGAGGAAACGCTGCCTGCAGAGGAGAAGCTCAAAATGTATCACTCCCTGGCAGGAAGTCT 

CCAGGACTCAGGGATTGTACTGAAACGAGTAACCATACATCT 

TTGGCTTATGAGAAAGCCAGGCATCAGATCCA 

TTGGTGAGACGGTGACAGAATGTGAGAGCCGAATGGTGCAGAGTATAGACTTCCAGACTGAGATGAGTCG 
CTCCCTGGACTGGCTGAGGAGAGTGAAGGCAGAGCTCAGTGGGCCGGTGTACCTAGACCTCAACCTGCAG 
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GACATCCAMAGGAAATCJVGWUUUVTCCAAA^ 

ATGCGCTGAGTCACAAGGAAAAGGAGAAGTTCACAAAGG CCAAGGAG CTGATTTCTGCGGATTTAGAACA 
CAGCCTCGCTGAGCTCTCAGAGCTGGATOK^^ 

ACTGAAATATATAGCCAGTGTCAAAGGTATTATCAGGTATTTCAAGCAGCCAATGACTGGCTTGAGGATG 
CCCAAGAAATGTTACAGCTGGCAGGCAATGGCCTAGACGTGGAGAGCGCAGAGGAAAATCTCAAAAGCCA 
CATGGAATTTTTCAGTACAGAGGATCAGTTCCATAGTAA 

CTGGACCCACTCATCAAGCCAACCGGCAAAGAAGACCTAGAACAGAAAGTGGCTTCTCTGGAACTCAGGA 

gccagaggatgagccgggact<^gtgcccaagtggatct^ 

ttaccagaaagcaagggaagaggttattgaattgatgaatgatacagaaaagaaattgtctgagttttct 
ttgttgaagacttcgtctagtcatgaagcggaagaaaaattg 
ttaactctttccatgagaaaattgtggcccttgag 
tgccagcaaagcgaccctgagcaggtcaatgaccaccgtctggca 

gcccaggaccaggagaagatcctggaagatgcagtggatgagtggacgggctttaacaacaaggttaaaa 
aggccactgaaatgattgatcagctgcaagataagttacctggaagttcagcagagaaagcatcgaaagc 
agagctcttaactcttcttgaatacgacgacacgtt^ 
ggcatgctgcggcaggaaaccctgagcatgctcc^ 

cgctcatgcaggaaatcaccgccatgcaagatcggtgcctgaacatgcaggagaaagtgaagactaatgg 

aaagttggtgaagcaagagctgaaggaccgagaaatggtggagactcagatcaattctgtgaaatgttgg 

gttcaggaaacgaaagaatatttagggaatccaacaatagaaatagatgctcaacttgaagaacttcaga 

ttctcctaacagaagccact^aatcaccgacagaacattgaaaaaatggcagaagaacagaaggagaagta 

cttaggtctttataccatattaccttctgaactctcccttcagttggctgaagtggcgttagatctaaag 

atccgagatcagatccaagacaaaataaaagaagttgagcagagcaaggccacgagccaggaactcagcc 

ggcaaattcagaagttagctaaagacctcacaactattctaactaagctgaaagcgaagacagataatgt 

agttc^gctaaaactgaccaaaaggtgctgggagaggaattagatggctgtaattcaaagttaatggaa 

ttagatgcagcagta<^gaaattcttggaacagaatggccaactgggtaagccactggk:caagaa 

gaaaactgactgaacttcaccagcagaccattagac^^ 

agcatcacatttagaagaatacaatgaaatgcttgaattaattttgaagtggattgaaaaagctaaagtc 
'ttggctcatggaactattgcatggaattctgcaagccagcttcgggaacaatatattttgcatcaggtaa 
ccttaggaaaaataatctttaaaaagtaaccaaggggaatttgatttaactgggtagactgacacaacac 
ttagagggctgtgatgtaaaatttttggagctaccagataaaaagaatgctaaggtacccctaagttgtt 
cagtagttggacagaa^ggagcttctcatgaaatttcatgaaattgaataaataaatatccttgatcttc 

CCTAAACCTACCTTACACCAAGACCCAAACCAATCAGCCT 

AATAATTATCTGCAGGGATCTGGTGGGAAATTCTTTCCTGTGAAGAGATGCAATGAAGTGTGGAAAGATT 

CTAGACTCC^CACTCAGAOTGGTGGGAAAACCAACCTCCGCCATGC^GGCTGTGTGATTTGGAGC^G^ 

TGCTTTGCCTCTCTGAATTCTGTCTTCTCATCATTTGTATGAAGACGTAAATAATATTCGTATTTCAGAC 

TTATGAGATCAAGTGGTTTAAGGTACACACGTGCAAACGTCTGCCTGGCACATGCAGGTGCTCAGTGGGA • 

GATCTCCCGCCTCCTCCCTCAGCCCTCACCCAGGCCTGTCATCTGGCCTTCCACAGGAGGTCGGGCAGCC 

CAGAGCAAGCCATGAGTCCACATCACATGCTGGCTATGTTAGTTCATTTCCTCTGAAGTTACATGAGAAA 

AATGTTCCTTTTCTGTCAGTCACGTC^TCCAGGAAATTATTTCATCCTTTTGTAACTTAAGCTTAAATTA 

GACACAGATAGTTAATAGGCTAGTTATCATATAATAAAATATAGGGTGACTTTTATAGGAGTTACATGGG 

TATCGAGTATTCTAGATTTTTGTCTCTTATATTATTTATGTATCCTTGTGGCCTTTAAATGAATCCCTGT 

TTCCATTCT^GTTTACAGGGTTCTAGATCAAAGCCTCATTTTTCCATTTTT^ 

TAATTTTTCCCTATATTCACAGTCTTTCTC 

TAAGCAACTCCAATCTTTTGCTTTAAGATATGCTTAGATATGAACAGACAGGACTTAAGTTACCACTGAT 
TTGAAAACAATGAAAAAAAGCCAAGATCCTTAGAAGTCT 

AAGAAAGACAAACTTAACTGTCACATTCATACTGTTTCTTTCAAGTTCATATTTAAGGAAGTGAGAGCTC 
TCAAACATTGCTGGTATCCTGGTAAAATCTCTTTGAAAAATAATTGGCA7VAATGTATGGTGATTGTCAAA 
AATGTTGCTACTCTGGGCCACXjTGCGGTGGCTCACACCTC 

GTGGATCACAGGGTCAGGAGTTCGGGACCGGCTTGGCCGGTATGGTGAAGCCCCATCTCTACTAGGAGTG 
CAAAAGTCAGCTGGGCGTGGTGGTGGGCGCCTGTAGTCCCAGCTACTCGGGAATCTGAGGCGGGAGAATC 
GCTTGAACTCGGGAGGTGGAGGTTGCAGTGAGCCAAGATCATGCCACTGCACTCCAGCCTGGGTG 
GAGACTCCATCTC 

Hitman SYNE 1 mRNA sequence- vari7 (public gi: 2052166D (seq id no: 199) 

GTTGGATTTCTCTAATGGAAAATGTTATTCAGAAGGATGAAGATAATATTAAAAATT C CATAGGTTACAA 

GGCAATTCATGAATACCTTCAGAAATATAAGGGTTTTAAGATAGACATTAACTGTAAAm 

GATTTTGTGAACCAGTCCGTGCTACAAATCAGCAGTCAGGATGTGGAAAGTAAGCGTAGTGATAAGACTG 

>"ATTTTGCTGAGCAACTTGGAGCAATGAATAAAAGTTGGCAAATTCTG^ 
CCAGCTGTTGGAAGGCTTATTGGAATCTTGGTCAGAATATGAAAATAATGTACAATGTCIXSAAAA 

TTTGAAACCCAGGAAAAGAGACTAAAACAACAGCATCGAATTGGAGATCAGGCT 

TGAAAGACTGTCAGGATCTGGAAGATTTGATTAAAGCAAAAGAAAAAGAAGTAGAGAAAATTGAGCAGAA 
TGGACTTGCTTTGATTCAGAAC^GAAAGAAGACGTCTCTAGCATTGTCATGAGCACACTGCGAGAGCTC 
GGCCAAACCTGGGCAAATTTAGATCACATGGTTGGAC^TTAAAGATACTGCTGAAATCAGTGCTTGACC 
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AATGGAGTAGTCACAAAGTGGCCTTTGACAAGAT^ 

CCGATTCCGTCTGCTGACTGGCTCCTTAGAAGCTGTGC^GTTCAGGTGGACAATCTTCAGAATCTCCAA 
GATGATCTGGAAAAACAGGAAAGGAGCTTACAGAAATTTGGCT 

GTCACCCACCCGTGACAGAAACTCTTACCAATACACTGAAAGAAGTCAACATGAGATGGAATAACTTGCT 
GGAAGAGATTGCTGAGCAGCTAC^GTCCAGCAAG^ 

TCCAAACAGTGTGCTTCGACAGTTCAGCAGCAGGAGGATCGAACCAATGAGCTGTTGAAGGCAGCCACAA 

ACy^AGGACATTGCCGATGATGAGGTTGCCACATGGATTCAAGATTGCAACGACCTCCTCAAAGGA 

GACAGTTAAAGATTCCCTCTTTGTOCTCCATG 

GCAGCATCAGCTATTCAATCGGATCAACTCTCT^ 

GCAAACAGCAGACTTCATTACAGGCTGGAGTTC^ 

GGAGGCCTGGATAGTGGAAGCTGAAGAAATACTACAAGGGCAGGACCCTAGCCACTCATCTGACCTCTCC 

ACAATCCAGGAAAGGATGGAAGAACTTAAGGGACAGATGTTAAAATTCAGCAGCATGGCTCCAGATTTAG 

ACCGTCTAAATGAGCTTGGATATAGGTTACCCTTGAATGATAAGGAAATCAAAAGAATGCAGAATCTGAA 

CCGCCATTGGTCTCTGATCTCCTCTCAGACTACAGAAAGATTCAGCAAGTTGCAGTCATTTTTGCTACAA 

CATCAGACTTTCTTGGAAAAATGTGAAACATGGATGGAATTCCTAGTTCAGACAGAACAAAAGT 

TAGAGATTTCAGGAAATTATCAGCACCTTTTGGAAC^GCAGAGAGCACACGAGTTGTTT 

GTTCAGTCGTCAGCAGATTTTGCACTCAATCATTATTGATGGGCAACGTCTTCTAGAACAAGGTCAAGTT 

GATGACAGGGATGAATTCAACCTGAAATTGAC^CTC^ 

CCCAGCAGAGGCGGGGGATCATTGACAGCCAGATTCGCCAGTGGCAGCGCTATAGGGAGATGGCAGAT^AA 
GCTTCGTAAATGGTTGGTTGAAGTGTCCTACCTCCCCATGAGTGGTCTCGGAAGTGTTCCTATACCACTG 
CAACAAGCAAGGACCCTCTTTGATGAAGTGCAGTTCAAAGAAAAAGTGTTTCTGCGGCAACAAGGCAGCT 
ACATCCTGACTGTGGAGGCTGGC^GC^CTCCTTCTCTCGGCGGACAGTGGCGCTGAGGCCGCCTTG^ 
GGCCGAACTCGCTGAAATCCT^GAGAAATGGAAATCAGCCAGCATGCGGCTGGAAGAACAGAAGAAAAAA 
OTAGCCTTCTTGTTGAAAGACTGGGAAAAATGTGAGAAAGGAATAGCAGATTCCCTGGAGAAACTACGAA 
CTTTCAAAAAGAAGCTTTCGCAGTCTCTCCCGGATCACCAT^ 

CAAGGAATTAGAAAATGCAGTTGGGAGCTGGAC7VGATGACTTGACCCAGTTGAGCCTGCTGAAGGACACC 
CTCTCTGCCTATATCAGTGCTGATGATATCTCCATTCTTAATGAACGCGTAGAGCTTCTGCAAAGGCAGT 
GGGAAGAACTATGCCACCAGCTCTCCTTAAGGCGGCAGCAAATAGGTGAAAGATTGAATGAATGGGCAGT 
CTT(^GTGAAAAGAAC^GGAACTCTGTGAGTGGTTGACTCAAATGGAAAGCAAAGTTTCT(^GAATGGA 
GACATTCTCATTGAAGAAATGATAGAGAAGCTCAAGAAGGATTATCAAGAGGAAATTGCTATTGCTCAAG 
AGAACAAAATACAGCTCCAACAAATGGGAGAACGACTTGCTAAAGCCAGCCATGAAAGCAAAGCATCTGA 
GATTGAATACAAGCTGGGAAAGGTC^^CGACCGGTGGCAGCATCTCCTGGACCTCATTGCAGCCAGGGTG 
AAGAAGCTGAAGGAGACCCTGGTAGCCGTGCAGCAGCTTGATAAGAACATGAGCAGCCTGAGGACCTGGC 
TCGCTCACATCGAGTCAGAGCTGGCCAAGCCAATAGTCT^ 

AAAGCTTAATGAGCAGCAGGAGCTTCAGAGAGACATAGAGAAGCACAGTACAGGTGT^ 
AACCTGTGTGAAGTCCTGCTGCACGACTGTGACGCCTGTGCCACTGATGCCGAGTGTGACTCTATACAGC 
AGGCTACGAGAAACCTGGACCGGCGGTGGAGAAACATTTGTGCTATGTCCATGGAAAGGAGGCTGAAAAT 
CGAAGAGACGTGGCGATTGTGGCAGAAATTTCTGGATGACTATTCACGTTTTGAAGATTGGCTGAAGTCT 
TCAGAAAGGAC^GCTGCTTTTCCCAGCTCTTCTGGGGTGATCTATACAGTTGCCAAGGAAGAAOT 
• AATTTGAGGCTTTCCAGCGACAGGTCCACGAGTGCCTGACGCAGCTGGAACTGATCAACAAGCAGTACCG 
CCGCCTGGCCAGGGAGAACCGCACTGATTCAGCATGTAGCCTCAAACAGATGGTTCACGAAGGCAACCAG 
AGATGGGACAACCTGCAAAAGCGTGTCACCTCGATCT^ 

AGGAGTTTGAGACTGCGCGGGACAGCATTCTGGTCTGGCTC^CAGAGATGGATCTGCAGCTCACTAATAT 
TGAACATTTTTCTGAGTGTGATGTTCAAGCTAAAATAAAGCAACTCAAG 

CTGAACCACAATAAGATTGAGCAGATAATTGCCCAAGGAGAACAGCTGATAGAAAAGAGTGAGCCCTTGG 

ATGCAGCGATCATCGAGGAGGAACTAGATGAGCTCCGACGGTACTGCCAGGAGGTCTTCGGGCGTGTGGA 

AAGATACCATAAGAAACTGATCCGCCTGCCTCTCCCAGACGATGAGCACGACCTCTCAGACAGGGAGCTG 

GAGCTGGAAGACTCTGCAGCTCTGTCGGACCTGCACTGGCACGACCGCTCTGCAGACAGCCTGCTTTCTC 

CACAGCCnTCCTCCAATCTCTCCCTCTCGCTCGCTCAGCCCCTCCGGAGCGAGCGGTCAGGACGAGACAC 

CCCAGCTAGTGTGGACTCCATCCCCCTGGAGTGGGATCACGACTATGACCTCAGTCGGGACCTGGAGTCT 

GCAATGTCCAGAGCTCTGCCCTCTGAGGATGAAGAAGGTCAGGATGACAAAGATTTCTACCTCCGGGGAG 

CTGTTGCCTTATGAGGGGACCACAGTGCCCTAGAGTCACAGATCCGACAACTGGG 

TAGCCGCTTTCAGATAGAGCAAACCGAAAATAT 

ACCAGCTACAAAGGCTACATGAAACTGCTGGGCGAATGCAGTAGGAGTATAGACTCCGTGAAGAGACTGG 

AGCACAAACTGAAGGAGGAAGAGGAGAGCCTTCCTGGCTTTGTTAACCTGGATAGTACCGAT^ACCCAAAC 

GGCTGGTGTGATTGACCGATGGGAGCTTCTC(^GGCCCAGGCATTGAGCAAGGAGTTGAGGATGAAGCAG 

AACCTCCAGAAGTGGCAGCAGTTTAACTCAGACT^ 

AGGAGTTGGAACAGCTCCAGCGTCTGGAACTCAGC^^ 

GCTCAAGGAGCTCCAGAAAGCTGTGGACCACCGCAAAGCCATCATCCTCTCGATCAAT 

GAGTTCACCCAGGCTGACAGCAAGGAGAGCCGGGACCTGCAGGATCGCTTGTCGCAGATGAATGGGCGCT 

GGGACCGAGTGTGCTCTCTGCTGGAGGAGTGGCGGGGCCTGCTGCAGGATGCCCTGATGCAGTGCCAGGG 

TTTCCATGAAATGAGCCATGGTTTGCTTCTTATGCTGGAGAACATTGACAGAAGGAAAAATGAAATTGTC 

CCTATTGATTCTAACCTTGATGCAGAGATACTTCAGGAC^ 
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AGCTGTTGGAATCCC^CTCAGAGTAGCCT^ 

AGGAACAGACTGTTTAGAAGCCAAAGAAAAAGTCCATGTTATTGGAAAT^ 
GAGGTCAGTCX3TCATATCAAGGAACTGGAGAAGTTATTAGACGTGTCAAGTAGTCAGC 

CCTGGTCTTCTGCTGATGAACTGGACACCTCAG 
CAGACAGAAAACGCCACGAGGCAAGTGTAGTCTCTCACAGCCTC 
"AGGTCCACAAAAGGTGGCTCCGATTCCTCCCTT^ 

TQTTCAGAGTCCTCCGAGCAGCTCTO^ 
^^GTACCAATGTCAGAGGAAGACTAC^GCTGTGCCCTCTCCAACAACTTTGCC 

CTCAGATACACGAATGGCCCTCCTCCACTCTGAACTAAGCAGATGCCATCTGCAGAAGTGCTGGTAGCAT 

AAGGAC^ATCGGGTCATAAGCAATCCCAAACTACCAA^^ 

TGTGGCAGCTTTAGCCCTCCTCCAGATC^CATGTGTGCAAATTATGGCTTCAGAGGTGGAAGATAAACAG 
TGACGGGGGAACAAACAGACAACAAGAAGGTTTGGAAGAAATCTGGTT^ 

AAGGAGATTGAGTAAGGACCTCCAAAGTTCCCCGGACTCATGAATTCTGGGCCCTTGGCCCATTCTGTGC 
ACAGCCAAGGACTTCAGTAGACCATCTGGGCAGCTTTCCCATGGTGCTGCTCCAACCATCAGATAAATGA 

CCCTCCCAAGCACCATGTCAGTGTCX3TACAATCTA 

GTAACATACAATTTTTAAGAGCTTATATGGCAGCTTCCTTTTTACCTTGTTTTCCTTTGGGGCATGATGT 
TTTAACCTTTGCTTTAGAAGCAC^VAGCTGTAAATCTAAAAGGCACTTTTTTTTAGAGGTATAAAGAAAAA 
CTAGATGTAATAAATAAGATCATGGAAGGCTTTATGTGAAAAAAGTTGAATGTTATAGT 

Human SYNEJ mRNA sequence - varl8 (public gi: 28195688) (seq id no 

TGTTCTCACGAGGGGGCCAGCTTGGGGCTTGACTGAGCAGG 

ATGTGACCTCTGCACATTGTTTACAGTTCCTAAACTGTGATTTCTTTTTCTGTGAAATAGTTATAATAG^ 

AAGTGGCTACCAAGTAGAAAGTGGTCATGGGGGGTGAAGGTTAAACACAATAACGGACACACAGAACTTA 

CACAGGGCATTTTATGCCAAGCTATATTGAATATOTATATCCCTCTACCTGCCCGTCAATGTCATGAATA 

TTGACAATTCACTCTAGACCCTGCTAGAAGAATCCAAAGAAATTGACAGTGAGCTGGAAGCAATGACTGA 

GAAATTACAGTACCTCACTAGCGTGTACTGTACAGAAAAAATGTCTCAGCAAGTGGCAGAAOT 

GAGACTGAGGAGTTGCGACAGATGATCAAAATTCGTTTGCAGAACCTCCAAGATGCAGCTAAGGATATGA 

kAAAATTTGAAGCAGAGTTGAAAAAGTTACAAGCTGCCTT^ 

AGAAGTTGGACGTCTCAGTCTCAAGGAGCAGCTCTCTC^^ 

CTGAAGCCGAAGGTGCAAGCAGTGCAGCTCTGCCAGAGTGCCCTCCGGATCCCCGAGGATGTGGTTGCCA 
GCTTACCTCTCTGTCATGCTGCTC^ 

GCAGTGTAAC^TCATGCAGGAAGCTGTGGTACAATATGAACAATATGAGCAAGAAATGAAA 
CAACTGATAGAAGGAGCTCACAGAGAGATTGAGGATAAACCTC 

AGGCTCAGATTTCTCGGCATGAGGAGCTGGCGCAGAAAATTAAGGGCTACCAGGAGCAGATCGCTTCTTT 
GAATTCCAAGTGC^^GATGCTGACGATGAAAGCCT^GCACGCCACCATGCTGCTGACGGTGACCGAGGTC 
GAGGGGCTGGCX3GAAGGGACAGAGGACCTGGATGGGGAGCT 

TGGTCATGATGACTGCAGGTCGCTGTCACACTTTGCTGTCACCGGTCACTGAGGAGTCTGGGGAGGAGGG 

AACCAACAGTGAGATTTCCTCTCCACCTGCCTGTCGCTCCCCTTCACCTGTGGCTAATACAGATGCTTCT 

GTTAACC^GGACATTGCATATTACCAAGCCTTGTCTGCTGAGAGGTTGCAGACAGATGCTGC?yVAAATTC 

ACCCC7VGCACATCCGCATCCCAGGAGTTCTATGAACCGGGATTGGAGCCATCCGCTACTGCCAAACTGGG 

TGATTTGCAGCGTTCTTGGGAAACCTTAAAGAATGTGATCAGTGAGAAGCAGCGC^CACTCTATGAAGCT 

TTGGAGCGCCAGCAGAAGTACCAGGACTCCCTCCAGTCCATCTCTACGAAGATGGAGGCCATTGAGCTGA 

AACTCAGTGAGAGCCCAGAGCCTGGGAGGAGTCCAGAAAGCCAGATGGCTGAACATGA 

TGAGATTCTGATGCTCCAGGATGAAATCAATGAGCTCC^ 

TCTTGTGAGGCCGACCCTGCGGAGCAGCTGGCCTTGCAGTCCACGCTCACTGTCTTAGCCGAGCGAATGT 
CCACCATCAGGATGAAAGCCTCGGGGAAACGGCAGCTTTTGGAGGAGAAGTTGAATGATCAGCTGGAGGA 
ACAAAGGCAGGAACAGGCCCTGCAGAGGTATCGCTGTGAAGCCGATGAGCTGGACAGCTGGCTCTTGAGT 

ACCAAGGCCACTCTGGACACTGCGCTGAGTCGA^ 

ACTGCCAGAATATGCTGGTGGAAATAGAGCAGAAGGTGGTGGCTTTATCAGAACTGTCAGTCCACAATGA 

G7VACCTGCTGCTGGAGGGCAAAGCTCACACCAAGGACGAGGCCGAGCAGCTGGCTGGAAAGCTGAGAAGG 

CTCAAGGGGAGCCTGCTGGAGCTGCAGAGAGCCCTGCATGATAAGCAGCTCAACATGCAGGG^ 

AGGAGAAGGAGGAGAGCGATGTTGACCTAAC7VGCCATC<^^ 

AGCTCGCACCACATGGACCCAGCAGCX^CAGAGCA 

GCCGAGCAGAAGAGTCTCCTTCGCTCAGTAGCCAGTCGTGGAGAGGAGATTCTAATTCAACATTCGGCGG 
CAGAGACOTCTGGTGATGCTGGCGAAAAACCTGATGTGTTATC C CAGGAGTTGGGGATGGAAGGGGAGAA 
ATCATCCGCTGAAGACCAGATGAGAATGAAATGGGAAAGCCTACATC^GAATTTAGTACCAAG(^GA^ 
CTACTACAGAATGTTCTGGAACAGGAACAAGAG CAAGTGCTTTATAGCAGGC CAAATCGACTCTTGTCTG 
GTGTGCCACTGTACAAAGGGGACGTGCCAACCCAAGATAAATCTGCAGTTACATCTTTGCTGGATGGACT 
GAACCAAGCCTTCGAGGAGGTTTCATCCCAGAGTGGAGGGGCA 

AAGTTGTATGATGGAGTCTCAGCCACCTCTACTTGGTTGGATGACGTTGAAGAACGTTTATTTGTTGCCA 
CAGCACTTTTACCAGAAGAAACAGAGACTTGTCTCTTCAACCAAGAGATTCTTGC^ 
AATGTCTGAAGAAATGGATAAGAACAAAAACTTGTTTTCCCAAG 
GATGTTATTGAAGATACTTTGGGTTGTCTTTTG^ 
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GATGTCATCAGATGAAAQAAAGACTTCAGCAAATACTAAATTTC 
ATCACTGGCT6ACAACAAATACATCATTCTGCAAAAACT 

CAAATAGAGGCAATACAACAGGCTGAAGATGGACTCAAAGAATTTGATGCAGGAATCATTGAATTAAAGA 
GGCGTGGTGACGAGCTACAGGTCGAGCAGCCGTCCATGCAAGAACTCTCCAAGCTCCAGGACATGTATGA 
TGAGCTGATGATGATCATTGGCTCCCGGAGGAGTGGTCTGAATCAGAACCTTACACTCAAGAGTCAGTAT 
GAGAGGGCCCTACAAGATCTGGCTGACCTGCTAGAAACTGGTCAGGAGAAGATGGCAGGAGACCAGAAAA 
TCATCGTGTCTTCCAAAGAGGAAATCCAGGAACCACTT 

ATCTCATATGATCTTGACTGTAACACTCTTC^GAAAGATAATCAGCTTTGCAGTCCAAAAGGAAACCCAG 
TTCCATACAGAGCTGATGGCTCAGGCTTCTGCTGTACTGA^ 

AGTACATTCTAGAGACGTGGTCCCATCTGGATGAGGACC^GCAGGAGCTCAGCAGAC^GCTGGAGGTGG 
GGAAAGCAGCATCCCAAGCGTGGGTCTGGTGGAGGAGAACX3AGGAC^^ 

TACCAGCATTTAAAATCTAGCCTTAATGAATACCAGCCCAAATTATATCAAGTATTAGATGATGGGAAAC 
GACTTCTGATATCCATCAGCTGCTCAGATCTAGAAAGCCAACTAAATCAACTTGGAGAGTGCTGGCTAAG 
TAACACCAATAAAATGTCTAAGGAACTTCACAGACT 

AGTGAATCTGCAGATCTAATTCACTGGTTAC^TCTGCAAAAGACCGGCrAGAATTTTGGACTCAGCAAT 
CTGTGACAGTCCCACAAGAGCTGGAAATGGTCCGTGATCATCTAAATGCTTTCCTGGAGTTTTCTAAAGA 
AGTGGATGCCCAATCTTCCCTGAAATCATCTGTTCTGAGTAC^ 

GTGGACACAGCCACGCTGCGCTCTGAGTTGTCGCGCATTGATAGCCAGTGGACTGACCTGCTAACCAATA 
TCCCAGCCGTCCAGGAGAAGCTCCACGAGCTCCAGATGGAT^ 

AGTCATGAGTTGGACTTCTCTAATGGAAAATGCTATTCAGAAGGATGAAGATAATATTAAAAATTCCATA 

GGTTACAAGGCAATTCATGAATACCTTCAGAAATATAAGGGTTTTAAGATAGACATTAACTGTAAACAGC 

TGACAGTGGATTTTGTGAACCAGTCCGTGCTACAAATCAGCAGTCAGGATGTGGAAAGTAAGCGTAG 

TAAGACTGATTTTGCTGAGCAACTTGGAGCAATGAATAAAAGTTGGCAAATTCTGGAAGGTCTAGT 

GAGAAGATCGAGCTGTTGGAAGGCTTATTGGAATCTTGGTCAGA 

AAACATGGTTTGAAACCCAGGAAAAGAGACTAAAACAACAGCATCGAATTGGAGATCAGGCT^ 

AAATGCACTGAAAGACTGTCAGGATCTGGAAGATCTGATTAAAGCAAAAGATAAAGAAGTAGAGAAAATT 

GAGCAGAATGGACTTGCTTTGATTCAGACCAAGAAAGAAGACGTCTCTAGCATTGTCATGAGCACACTGC 

GAGAGCTCGGCCAAACCTGGGCAAATTTAGATCACATGGTTGGACAATTAAAGATACTGCTGAAATCAGT 

GCTTGACCAATGGAGTAGTCACAAAGTGGCCTTTGACAA 

TCTCTTTCCCGATTCCGTCS^ 

ATCTCCAAGATGATCTGGAAAAACAGGAAAGGAGCTTAC^GAAATTTGGCTCTATCACCAACCAATTATT 

AAAAGAGTGTC^CCCACCCGTGACAGAAACTCTTACCAATACACT 

AACTTGCTGGAAGAGATTGCTGAGGAGCTACAGTCCAGCAAGGC^ 

AGGACTACTCCAAACAGTGTGCTTCGACAGTTCAGCAGCAGGAGGATCGAACCAATGAGCTGTTG7VA 

AGC^CAAACAAGGACATTGCCGATGATGAGGTTGCCACATGGATTCAAGATTGCAACGACCTCCTCAAA 

GGACTGGGCACAGTTAAAGATTCCCTCTTTGTTCTCCATGAGCTGGGAGAGCAACTGAAGGAACAAGTGG 

ATGCTTCCGCAGCATCAGCTATTCAATCGGATCAACT^ 

AGCTCTCTGCAAAGAGCAGACTTCATTACAGGCTGGAGTT 

GAAGCTTTGGAGGCCTGGATAGTGGAAGCTGAAGAAATACTACAAGGGCAGGACCCTAGCCACTCATCTG 
ACCTCTCCACAATCCAGGAAAGGATGGAAGAACTTT^GGGACAGATGTTAAAATTCAGCAGCATGGCTCC 
AGATTTAGACCGTCTAAATGAGOTTGGATATAGGT^^ 
AATCTGAACCGCCATTGGTCTCTGATCTCCTCTCAGAC 

TGCTACAACATCAGACTTTCTTGGAAAAATGTGAAACATGGATGGAATTCCTAGTTCAGA 

GTTAGCAGTAGAGATTTCAGGAAATTATCAGCACCTTTTG 

GCCGAGATGTTCAGTCGTCAGCAGATTTTGCACTCAATCATO 

GTCAAGTTGATGACAGGGATGAATTCAACCTGAAATTGACACTCCTGAGTAATCAATGG 

TCGCAGGGCCCAGCAGAGGCGGGGGATCATTGAC^GCCAGATTCGCCAGTGGC^GCGCTATAGGGAGATG 

GCAGAAAAGCTTCGTAAATGGTTGGTTGAAGTGTCCTACCTCCCCATGAGTGGTCTCGGAAGTGTTCCTA 

TACCACTGCAACAAGOVAGGACCCTCTTT^ 

AGGCAGCTACATCCTGACTGTGGAGGCTGGCAAGCAACTCCT^ 

GCCTTGCAGGCCGAACTCGCTGAAATCCAAGAGAAATGGAAATCAGC 

AGAAAAAACTAGCOTTCTTGTTGAAAGACTGGGAAAAATGTGAGAAAGGAATAGC^GATTCCCTGGAGAA 

ACTACGAACTTTCAAAAAGAAGCTTTCGCAGTCTCT 

ATGCGTTGCAAGGAATTAGAAAATGCAGTTGGGAGCTGGACAGATGA 

AGGAC^CCCTCTCTGCCTATATCAGTGCTGATGATATCTCCATTCTTAATGAACGCGTAGAGCTTCTG^ 

AAGGCAGTGGGAAGAACTATGCCACCAGCTCTCCTTAAGGCGGCAGCAAATAGGTGAA^ 

TGGGCAGTCTTCAGTGAAAAGAACAAGGAACTCTGTGA 

AGAATGGAGACATTCTCATTGAAGAAATGATAGAGAAGCTCAAGAAGGATTATCAAGAGGA 

TGCTCAAGAGAAGAAAATACAGCTCCAACAAATGGGAGAAC^ 

GCATCTGAGATTGAATAGAAGCTGGGAAAGGTCAACGACCGG 

CCAGGGTGAAGAAGCTGAAGGAGACCCTGGTAGCCGTGCAGCAGCTTGATAAGAACATGAGCAGCCTGAG 
GACCTGGCTCGCTCACATCGAGTCAGAGCTGGCCAAGCCAAT^ 

ATACAGAGAAAG CTT AATGAGCAGCAG GAGCTTCAGAGAGACATAGAGAAG CACAGTACAGGTGTTGCAT 
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CTGTCCTCAACCTGTGTGAAGTCCTGCTGCACGACTGTGACGC 

TATACAGCAGGCTACGAGAAACCTGGACCGGCGGTGGAGAAACATTTGTGCTATGTCCATGGAAAGGAGG 
CTGAAAATCGAAGAGACGTGGCGATTGTGGCAGAAATTTCTGGATGACTATTCACGTTTTGAAGATTGGC 

TGAAGTCTTCAGAAAGGACAGCTGCTTTTC^ 

ACTAAAGAAATTTGAGGCTTTCCAGCGACAGGTCCACGAGTGCCTGACGCA^ 

CAGTACCGCCGCCTGGCCAGGG^GAACCGCACTGATTCAGCATGT^ 

GCAACCAGAGATGGGACAACCTGCAAAAGCGTGTCACCTCCATCOT 

CCAGCGTGAGGAGTTTGAGACTGCGCGGGACAGCATTCTGGTCTGGCTCACAGAGATGGATCTGCAGCTC 

ACTAATATTGAACATTTTTCTGAGTGTGATGTTCAAGCTAAAATAAAGCAACTCAAGGCCT 

AAATTTCACTGAACCACAATAAGATTGAGCAGATAATTGCCCAA 

GCCCTTGGATGCAGCGATCATCGAGGAGGAACTAGATGAGCTCCGACGGTACTGCCAGGAGGTCTTCGGG 
CGTGTGGAAAGATACCATAAGAAACTGATCCGCCTGCCTCTCCCAGACGATGAGCACGACCTCTCAGACA 
GGGAGCTGGAGCTGGAAGACTCTGCAGCTCTGTCGGACCTGCACrrGGCACGACCGCTCTGCAGACAGCCT 
GCTTTCTCCACA.GCCTTCCTCCAATCTCTCCCTCTCGCTCGCTCAGCCCCTCCGGAGCGAGCGGTCAGGA 
CGAGACACCCCAGCTAGTGTGGACTCCATCCCCCTGGAGTGGGATCACGACTATGACCTCAGTCGGGACC 
TGGAGXCTGCAATGTCCAGAGCTCTGCCCTCTGAGGATGAAGAAGGTCAGGATGACAAAGATTTCTACCT 
CCGGGGAGCTGTTGCCTTATCAGGGGACCA.CAGTGCCCTAGAGTCACAGATCCGACAACTGGGCAAAGCC 
CTGGATGATAGCCGCTTTCAGATACAGCAAACCGAAAATATCAT^ 

AGCTAGACACCAGCTACAAAGGCTACATGAAACTGCTGGGCGAATGCAGTAGCAGTATAGACTCCGTGAA 
GAGACTGGAGCACAAACTGAAGGAGGAAGAGGAGAGCCTTCCTGGCTTTGTTAACCTGCATAGTACCGAA 
ACCCAAACGGCTGGTGTGATTGACCGATGGGAGCTTCTCCAGGCCCAGGCATTGAGCAAGGAGTTGAGGA 
TGAAGCAGAACCTCC^GAAGTGGCAGCAGTTTAACTCAGACTTGAACAGCATCTGGGCCTGGCTGGGGGA 
CACGGAGGAGGAGTTGGAACAGCTCCAGCGTCTGGAACTCAGCACTGACATCCAGACCATCGAGCTCCAG 
ATCAAAAAGCTCAAGGAGCTCCAGAAAGCTGTGGACCACCGCAAAGCCATCATCCTCTCCATCAATCTCT 
GCAGCCCTGAGTTCACCCAGGCTGACAGCAAGGAGAGCCGGGACCTGCAGGATCGCTTGTCGCAGATGAA 
TGGGCGCTGGGACCGAGTGTGCTCTCTGCTGGAGGAGTGGCGGGGCCTGCTGCAGGATGCCCTGATGCAG 
TGCC^GGGTTTCCATGAAATGAGCCATGGTTTGCTTCTTATGCTGGAGAACATTGACAGAAGGAAAAATG 
XAA/ITGTCCCTATTGATTCTAACCTTGATC 

AAAGCATGAGCTGTTGGAATCCCAACTCAGAGTAGCCTCTTTGC^^ 

AATGCTGAAGGAACAGACTGTTTAGAAGCCAAAGAAAAAGTCCATGTTATTGGAAATCGGCTCAAACTTC 
TCTTGAAGGAGGTCAGTCGTCATATGAAGGAACTGGAGAAGT^ 
TTTGTCTTCCTXK3TCTTCTGCTGATGAACTGGACACCTCAGG 
ACCCCAAACAGACAGAAAACGCC^CGAGGCAAGTGTAGTCTC 

CACATAGCAGGTCCACAAAAGGTGGCTCCGATTCCTCCCTTTCTGAGCCAGGGCCAGGTCGGTCCGGCCG 

CGGCTTCCTGTTCAGAGTCCTCCGAGCAGCTCTTCCCCTTCAGCTTCrCCTGCTCC^ 

GCOTGCCTTGTACCAATGTCAGAGGAAGACTACAGCTGTGCCCTCTCCAACAACTTTGCCCGGTCATTCC 

ACCCCATGCTCAGATACACGAATGGCCCTCCTCCACTCTGAACTAAGCAGATGCCATCTGCAGAAGTGCT 

GGTAGCATAAGGAGGATCGGGTCATAAGCAATCCCAAACTACCAACAAGAGGACCTTGATCTTGGCGAAA 

GCCCTCGGTGTGGCAGCTTTAGCCCTCCTCCAGATCACATGTGTGCAAATTATGGCTTCAGAGGTGGAAG 

ATAAACAGTGACGGGGGAACAAACAGACAACAAGAAGGTTTGGAAGAAATCTGGTTTGAGACTCTGA 

TTAGCACTAAGGAGATTGAGTAAGGACCTCCAAAGTTCCCCGGACTCATGAATTCTGGGC 

TTCTGTGCACAGCCAAGGACTTCAGTAGACCATCTGGGCAGCTTTCCCATGGTGCTGCTCCAACCATCAG 

ATAAATGACCCTCCCAAGCACCATGTCAGTGTCGTACAATCTACCAACCAACCAGTGCTGAAGAGATTTT 

AGAACCTTGTAACATACAATTTTTAAGAGCT^^ 

CATGATGTTTTAACCTTTGCTTTAGAAGCACAAGCTGTAAATCTAAAAGGCACTTTTTTTTAGAGGTATA 
AAGAAAAACTAGATGTAATAAATAAGATCATGGAAGGCTTTATGTGAAAAAAGTTGAATGTTATAGT 

Human SYNE 1 mRNA sequence - varl9 (public gi: 28195676) (seq id no; 201) 

CAAGGGGAAACTTTCATCCCCACGCAGGTTATAGCTTTTTGCTCTGCAGAGTCT 

AGCTTCATGGTGGTGGCGGAGGACCTGAGTGCCCTGAGGATGGCAGAGGACGGCTGTO 

TCCCAGATTGTAACTGCGATGTCACAAGGGCCAGGGTGAAGA^^ 

GCAGCTTGATAAGAACATGAGCAGCCTGAGGACCTGGCTCGCTCACATCGAGTCAGAGCTGGCCAAGCC^ 
ATAGTCTACGATTCCTGTAACTOKSAAGAAATACAGAGAAAGCrT 

ACATAGAGAAGCACAGTACAGGTGTTGGATCTGTCCTCAACCTGTGTGAAGTCCTGCTGCACGACTGTGA 

CGCCTGTGCCACTGATGCCGAGTGTGACTCTATACAGCAGGCTACGAGAAACCTGGACCGGCGGTGGAGA 

AACATTTGTGCTATGTCCATGGAAAGGAGGCTGAAAATCGAAGAGACGTGGCGATTGTGGC^GAAATTTC 

TGGATGACTATTCACGTTTTGAAGATTGGCTGAAGTCTTCAGAAAGGACAGCTGCTTTTCCCAGCTCTTC 

TGGGGTGATCTATACAGTTGCCAAGGAAGAACTAAAGAAATTTGAGGCTTTCCAGCGACAGGTCCACGAG 

TGCCTGACGCAGCTGGAACTGATCAACAAGCAGTACCGCC^ 

CATGTAGCCTCAAACAGATGGTTCACGAAGGCAACCAGAGATC 

CATCTTGCGCAGACTCAAGCATTTTATTGGCC^ 

GTCTGGCTCACAGAGATGGATCTGCAGCTCACTAATATTGAA 

AAATAAAGCAACTCAAGGCCTTCXIAGCAGGAAATTTCACTGAACGACA^ 
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CCAAGGAGAACAGCTGATAGAAAAGAGTGAGCCCTTGGATGCAGCGATCATCGAGGAGGAACTAGATGAG 
CTCCGACGGTACTX3CCAGGAGGTCTTCGGGCGTGTGGAAAGATACCATAAGAAACTGATCCGCCTGCCTC 
TCCCAGACXJATGAG^CGACCTCTC^GACAGGGAGCTGGAGCTGGAAGACTCTGCAGCTCTGTCGGACCT 

GCACTGGCACGACCGCTCTGCAGACAGCCTC 

GCTCAGCCCCTCCGGAGCGAGCGGTCAGGACGAGACACCCCAGCTAGTGTGGACTCCATCCCCCTGGAGT 
^GGGATCACGACTATGACCTCAGTCGGGACCTGGAGTOTGCAATGTCCAGAGCTCTGCCCTOT 

AGAAGGTCAGG&TGACAAAGATTTCTACCT 
-•^^GCCCTGAGGCCTATGTAAAACTCACAGAAAATGCT^ 

AGTCACAGATCCGACAACTGGGCAAAGCCCTGGATGATAGCCGCTTTCAGATACAGCAAACCGAAAATAT 

CATTCGCAGCAAAACTCCC^CGGGGCCGGAGCTAGACACCAGCTACAAAGGCTACATGAAACT 
GAATGCAGTAGCAGTATAGACTCCGTGAAGAGACTGGAGCACAAACTGAAGGAGGAAGAGGAGAGCCTTC 
CTGGCTTTGTTAACCTGCATAGTACCGAAACCCAAACGGCTGGTGTGATTGACCGATGGGAGCTTCTCCA 
GGCCCAGGGATTGAGCAAGGAGTTGAGGATGAAGCAGAACCTCCAGAAGTGG CAG CAGTTTAACTCAGAC — 
TTGAACAGCATCTGGGCCTGGCTGGGGGACACGGAGGAGGAGTTGGAACAGCTCCAGCGTCTGGAACTCA 

GCACTGAC^TCC^GACCATCGAGCTCCAGATC^^ 

CAAAGCCATCATCCTCTCCATCAATCTCTGCAGCCCTGAGTTCACCCAGGCTGACAGCAAGGAGAGCCGG 

GACCTGCAGGATCGCTTGTCGCAGATGAATGGGCGCTGGGACCGAGTGTGCTCTCTGCTGGAGGAGTGGC 

GGGGCCTGCTGCAGGATGCCCTGATGCAGTGCCAGGGTTTCCATGAAATGAGCCATGGTTTGCTTCTTAT 

GCTGGAGAACATTGACAGAAGGAAAAATGAAATTGTCCCTATTGATTCTAACCTTGATGCAGAGATACTT 

CAGGACGATCACAAAGAGCTTATGCAAATAAAGCATGAGCTG 

TGCAAGACATGTCTTGCCAACTACTGGTGAATGCT 

CCATGTTATTGGAAATCGGCTCAAACTTCTCTTGAAGGAGGTCAGTCGTCATATCAAGGAACTGGAGAAG 
TTATTAGACGTGTCAAGTAGTCAGCAGGATTTGTCTTCCTGGTCTTCTGCTGATGAACTGGACACCTCAG 
GGTCTGTGAGTCCCACATCAGGAAGGAGCACCCCAAACAGACAGAAAACGCCACGAGGCAAGTGTAGTCT 
CTCACAGCCTGGACCCTCTGTCAGCAGTCCACATAGCAG 

TCTGAGCC^GGGCCAGGTCGGTCCGGCCGCGGCTTCCTGTTCAGAGTCCTCCGAGCAGCTCTTCCCCTTC 
AGCTTCTCCTGCTCCTCCTCATCGGGCTTGCCTGCCTTGTACCAATGTCAGAGGAAGACTACAGCTGTGC 
CCTCTCCAACAACTTTGCCCGGTCATTCCACCCC^ 

ACTAAGCAGATGCCATCTGCAGAAGTGCTGGTAGCATAAGGAGGATCGGGTCATAAGCAATCCCAAACTA 
CCAACAAGAGGACCTTGATCTTGGCGAAAGCCCTCGGTGTGGCAGCTTTAGCCCTCCTCCAGATCACATG 
TGTGCAAATTATGGCTTCAGAGGTGGAAGATAAACAGTGACGGGGGAACAAACAGACAACAAGAAGGTTT 
GGAAGAAATCTGGTTTGAGACTCTGAACCTTAGCACT^ 

GGACTCATGAATTCTGGGCCCTTGGCCGATTCTGTGCACAGCCAAGGACTTCAGTAGACCA 
CTTTCCCATGGTGCTGCTCCAACC^ 

TACCAACCAACCAGTGCTGAAGAGATTTTAGAACCTTGTAAC 
CTTCCTTTTTACCTTGTTTTCCTTTGGGGCATC 
TCTAAAAGGCACTTTTTTTTAGAGGTATAAAGAAAAA 
ATGTGAAAAAAGTTGAATGTTATAGT 

Human SYNE1 Protein sequence - varl (public gi: 21753085) (seq id no: 295) 

MVVDDLFEDMKDGVKIjIjALiLEVLSGQK^ 

RPSIVLGLmTIILYFQIEELTSNLPQLQSLSSSASSVDSIVSSETPSPPSKRKVTTKIQGNAKKAIJJ^ 
VQ YTAGKQTGI EVKDFGKSWRSGVAFHS VIHAI RPELVDLETVKGRSNRENLEDAFTI AETELGI PRLLD 
PEDVDVDKPDEKSIJVrTWAQFLKHYPDIHNASTDGQEDDE 

EQFERDLTRAQMVESNLQDKYQSFKHFRVQYEMKRKQ I EHLI QPLHRDGKLS LDQALVKQSWDRVTS RLF 
DWHIQLDKSLPAPLGT IGAWL YRAEVALREE I TVQQVHEETANTIQRKLEQHKDLIjQNTDAHKRAFHEI Y 
RTRSVNGIPVPPDQLEDMAERFHFVSSTSEIiHLMKMEFLELKYRLLSL^^ 
QLIiQNYVSFIENSKFFEQYEVTYQILKQTAE^^ 

liEEVISNWDRYGNWASLQAWLEDAEKMLNQSENAKKDFFRNLPHWIQQH^ 
DLKQQLLLLNGRWRELFMEVKQYAQADEI^RMK^ 
DLEDIEQRVPVMDAQYKI ITKTAHLITKESP 

Human SYNE1 Protein sequence - var2 (public gi: 19584385) (seq id no: 296) 

KXiLIQDLEDI EQRVPVMDAQ YKI I TKTAHLI TKES PQEEGKEMFATMS KLKEQLTKVKECYS PIjIiYESQQ 
LLI PLEELEKQMTSFYDSLGKINEI ITVLEREAQSSALFKQKHQELLACQEKTCKKTLTLIEKGSQSVQKF 
VTLSNVIiKHFDQTRLQRQIADIHVAFQSMVKKTGDWKKHVETNS ' 
KGDPEELLRRHTEFFSQLDQRVLNAFLKACDELTDILPEQEQQGIjQEAVRKIjHKQW 

KIDVEKNRFLASVEECRTELDRETKLMPQEGSEKI I KEHRVFFSDKGPHHLCEKRLQLIEELCVKLPVRD 
PVRDTPGTCHVTLKELRAAIDSTYRKLMEDPDKWKDYTSRFSEFSSWI STN KGEAIDTANHGE 
VKRAVEEIRNGVTIOIGETLSWLKSRLKVLT^ 

DCVQYKEIVKNSLEELI SGSKEVQEQAEKI LDTENX1FEAQQL1LLHHQQKTKRISAKKRDVQQQIAQAQQG 
EGGLPDRGHEELRKLESTLDGLERSRERQERRIQVTLRKWERFETO 
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ES LSSELEQTKEFSKRTES I AVQAENLVKEASE I PLGPQNKQLLQQQAKS I ^^^5^?^??^^^? 
SS ATETYKVLQEHMDLCQALESLSSAI TAFSASARKWNRDS CVQEAAALQQQYEDI LRRAKERQTALEN 

IIK ■ - 

Human SYNE1 Protein sequence- var3 (public gi: i786i3?8) (seq id no= 297) 
mgerlakasheskaseieyklgkv^^ 

AKPIVYDSOJSEEIQRKIiNEQQEl^RDIEKHST^ 

RWRNI CAMSMERRLKI EETWI^WQKFIjDDYS RFEDWLKSSERTAAFPSS SGVI YTVAKEELKKFEAFQRQ 
VHECLTQLEI.INKQYRRIiAREmTDSA 

S I IjVWLTEMDLQLTNI EHFS ECDVQAKI KQLKAFQQE I SLNHNKI EQI IAQGEQLI EKSE PLDAAI I EEE 
LDELRRYCQEVFGRVERYHKKLIRLPIjPDDEHDLSDRELELEDSAALSDLHWHDRSJ^ 
LSIiAQPLRSERSGRDTPASVDS I PI1EWDHDYDI1SRDLE SAMSRALPSEDEEGQDDKD FYLRGAVALSDVM 
IPESPE^YVKLTENAIKNTSGDHSA^ 

LliGECSSSIDSVWRIiEHlQLiKEEEESLPGFVlTLHSTETQTAGVIDRWEIiIiQAQA^ 

NSDLNSIWAWLGDTEEELEQIjQRLELSTDIQTIELQIKKLKEIjQKAVDHRKAIIIiSINIjCSPEFTQADSK 

ESRDLQDRLSQMNGRWDRVCSLLBEWRGLL.QDA^ 
EILQDHHKQIMQIKHELLESQLRVASLQDMSCQL^^ 

leklldvsssqqdlsswssadeldtsgsvsptsgrstpnrqktprgkcslsqpgpsvssphsrstkggsd 
sslsepgpgrsgrgflfrvlraalplqij^llllig^^ 

PL 

I 

Human SYNEl Protein sequence - var4 (public gi: i786i386) (seq id no: 298) 
meldaavqkfleqngqlgkplakkigkl^ 

KV^AHGTIAWNSASQLRKQYILHQT^ 

RQMI KIRIjQNIjQDAAKDMKKFEAEIjKKLiQAALEQAQATLTS PEVGRLSIiKEQLSHRQHIjLSEMESIjKPKV 
QAVQLCQSALRI PEDVVASLPLCHAALRLQEEASRIjQHTAI QQCNI MQEAWQYEQYEQEMKHLQQIjI EG 
AHREIEDKPVATSNIQELQAQI SRHEELAQKI KGYQEQI ASLNSKCKMIjTMKAKHATMIjLTVTEVEGIiAE 
GTEDLDGELLPT PSAHPS WMMTAGRCHTLLS PVTEESGEEGTNS E I SS PPACRS PS PVANTDASVNQDI 
AYYQALS AERIjQTDAAKI HPSTSASQE FYEPGLEPSATAKLGDLQRSWETIiKNV I SEKQRTIiYEALiERQQ 
KYQDSLQSISTKMEAIELKLSESPEPGRSPESQMAEHQALMDEIimiQDE 
PAEQIiALQSTLTVLAERMSTIRMKASGK^ 
DTALSPPKEPMDMEAQIiMDCQNMLVEIEQKVV^ 

LELQRALHDKQLimQGTAQEKEESDVDLTATQSPGVQEWLAQARTTWTQQRQSSLQQQKELEQE 
LLRSVASRGEEILIQHSAAETSGDAGEKPDVLSQEIXSMEGEKSSAEDQI^KWESLHQEFSTKQKLLQNV 
LEQEQEQVLYSRPNRLLSGVPLYKGDVPTQDKSAVTS LLDGIjNQAFEEVSSQS GGAKRQS I HLEQKLYDG 
VSATSTWLDDVEERLFVATALLPEETETCLFNQEI LAKDI KE^ ED 
TLGCLLGRLSIJjDSVVNQRCHQMKERLQQII^ 

QQAEDGLKEFDAGI lELKRRGDELQVEQPSMQELSKLQDMYDEIiMMI IGSRRSGLNQNIiTIiKSQYERALQ 
DLADLIjETGQEKMAGDQKIIVSSKEEIQQPLDKHKEYFQGL^ 

MAQASAVLKRAHKRGVELEYILETWSHIjDEDQQELSRQLEVVESS I PSVGLVEENEDRLIDRITLYQHLK 
SSLNEYQPKLYQVLDDGKRI^ISISCSDLESQ^ 

LIHWLQSAKDRLEFWTQQSVTVPQELEMVRDHLNAFLEFSKEVDAQSSLKSSVLSTGN 
IiRSELSRIDSQWTDLLTNI PAVQEKLHQLQMDKLPSRHAISEVMSWTSLMENAIQKDEDNI KNS IGYKAI 
HEYLQKYKGFKIDINCKQLTVDFVNQSVLQISSQDVESKRSDKTDFAEQLGAMNKSWQILQGLVTEKIQL 

I^GLLESWSEYENNVQCLKTWFETQEKRLKQQHRIGTO 
ALIQTKKEDVSSIVMSTLRELGQTWANLDHMVGQL 
RL&TGSLEAVQVQVDN^NL^ 
IAEQLQSSKALI^LWQRYKDYSKQCAST^^ 

KDSLEVraELGEQLKQQVDASAASAIQSDQLSLSQHLC^EQALCKQQTSLQAGVLDY 
WIVEAEEILQGQDPSHSSDLSTIQERMEELKGQMLKFSSMAPDLDRLNELGYRL 

WSLI SSQTTERFS KLQS FIiLQHQTFIiEKCETWMEFLVQTEQKIiAVE I SGNYQHLLEQQRAHELFQAEMFS 
RQQ I LHS 1 1 IDGQRLLEQGQVDDRDEFNLKIiTIiIjSNQWQGVI RRAQQRRG I IDSQ I RQWQRYREMAEKLR 
K^VEVSYLPMSGLGSVPIPLQQARTLFDEVQFKEKVFLRQQGSYILTVEAGKQLLLSADSGAEAAL . 

LAEIQEKWKSASMRLEEQKKKLAFL^ 

IiENAVGSWTDDLTQLS LLKDTLSAYI SADDI S I LNERVELLQRQWEELCHQLSIjRRQQI GERIjNEWAVFS 

EKNKELCEWLTQMESKVSQNGDILIEEMIEKLKKDYQ 

YKI/jKVNDRWQHLLDIjIAARVKKLKETIjVAVQQLDKNMSSLRTO 

NEQQELQRDI EKHSTGVAS VLNLCEVLLHDCDACATDAECDS IQQATRNLDRRWRNI CAMSMERRLKI EE 
TWRIiWQKFLDDYSRFEDVOjKSSERTAAFPSSSGVIYTVAKEELKKFEAFQRQVHECLT^^ 
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ARENRTDSACSLKQMVHEGNQRWDNLQKROT 

FSECDVQAKIKQIjKAFQQEISIjNHNKIEQI IAQGEQIiIEKSEPLDAAI IEEELDEIjRRYCQEVFGRVERY 
HKKLIRLPLPDDEHDLSDRELELEDSAALSDLHWHDRSADSI^ 

SVDSIPLEWDHDYDLSRDLESi^RALPSEDEEGQDDKDFYIjRGAVALSGDHSALESQIRQIiGKALDDSR 
FQIQQTENIIRSKTPTGPELDTSYKGYMKLLGECSSSIDSVKI^^ 

VIDRWELMAQALSKELRMKQNLQKWQQFNSDLNSIWAWLGDTEEELEQLQRLELSTDIQTIELQIKKIiK 

ELQKAVDHRKMILSINLCSPEFTQADSKESR 

EMSHGIiLMLENIDRRKNEIVPIDSNLDAEI^ 

DCLEAKEKVHVIGNRIiKLLLKEVSRHIKELEKLIiDVSSSQQDIiSSWSSADEIiDTSGSVS 

KTPRGKCSLSQPGPSVSSPHSRSTKGGSDSSLSEPGPGRSGRGFLFRVLRAALPLQLLLLLLIGLACLVP 

MSEEDYSCALSNNFARSFHPMIiRyTNGPPPIi 

Human SYNE 1 Protein sequence- var5 (public gi: 17227154) (seq id no: 299) 

MRLEEQKKKIiAFIjLKDWEKCEKX3IADSIiEKI*RTFKKK^ 

TQLSIiLKDTIiSAYISADDI S I LNERVEIiLQRQWEELCHQLSLRRQQIGERLNEWAVFS EKNKELCEWLTQ 

MESKVSQNGDILIEEMIEKIiKKDYQEEIAIAQENKIQLQQMGERIAKASHESKASEIEYI^ 

LLDL IAARVKKLKETLVAVQQLDKNMSSIiRTWLAHI ESEIjAKP I VYDSCNS EE I QRKLNEQQELQRD I EK 

HSTGVASVIiNLCEVLLHDCDACATDAECDS I QQATRNLDRRWRNI CAMSMERRLKI EETWRLWQKFIiDDY 

SRFEDWLKSSERTAAFPSSSGVTYTVAKEELKKFEAFQRQVHECIjTQLELINKQYRRLARENRTDSACSIj 

KQMVHEGNQRWDNLQKRVTS I LRRLKHFIGQREE FETARDS I LVWLTEMDLQLTNI EHFSECDVQAKI KQ 

LKAFQQE I SLNHNKI EQ 1 1 AQGEQLI EKS EPLDAAI I E EEIiDELRRYCQEVFGRVERYHKKLI RliPLPDD 

EHDIiSDRELEIiEDSAALSDLHWHDRSADSLLSPQPSSNLSLSLAQPLRSERSGRDTPASTO 

YDLSRDLESAMSRALPSEDEEGQDDKDFYLRGAVALSGDHSALESQIRQLGKALDDSRFQI QQTENI IRS 

KTPTGPELDTSYKGYMKLLGECSSSIDSVKRLEHKLKEEEESLPGFVNLHSTETQTAGVIDRWELLQAQA 

LSKELRMKQNLQKWQQFNSDLNSIWAWLGDTEEEL^ 

I LS INLCS PEFTQADS KE SRDLQDRLSQMNGRWDRVCS LLEEWRGLLQDALMQCQGFHEMSHGLLLMLEN 
IDRRKNEI VPIDSNLDAE I LQDHHKQLMQIKHELLESQLRVASLQDMSCQLIjVNAEGTDCLEAKEKVHVI 
GimLKLLLKEVSRHIKELEKLLDVSSSQQDIiSSWSSADELDTSGSVSPTSGRSTPNRQKTPRGKCSLSQP 
GPSVSSPHSRSTKGGSDSSLSEPGPGRSGRGFLFRVLRAALPLQLLLLLLI^ 
NFARSFHPMIiRYTNGPPPIi 

Human SYNE1 Protein sequence - var6 (public gi: 12698057) (seq id no: 300) 

QRKLEQHKDIiLQNTDAHKRAFHEIYRTRSW 

LSLLVLAESKLKSWIIKYGRRESVEQLLQNYVSFIENSKFFEQYEVTYQILKQTAEMYVKADGSVEEM 

VMKFMNETTAQWRNLSVEVRSVRSMLEEVISNWDRYGN^ 

WIQQHTAMNDAGNFLIETCDEIWSRDL^ 

FATEAHKKLSEPLEVS FMNVKLLI QDLED I EQRVPVMDAQYKI I TKTAHL I TKES PQEEGKEMFATMS KL 
KEQLTKVKECYSPLIjYESQQIjLI PLEELEKQMTSFYDSLGKINEI ITVLEREAQSSAIiFKQKHQELIjACQ 
ENCKKTLTIiI EKGSQSVQKFVTLSNVLKHFDQTRLQRQ IADI HVAFQSMVKKTGDWKKHVETNS RIiMKKF 
EESRAELEKVLRIAQEGLEEKGDPEELLRRHTEFFSQLDQRVLNAFLKACDELTDILPEQEQQGLQEAVR 
KLHKQWKnLQGEAPYHLLHLKIDVEKNRFIiASVEECRTELDRETKLMPQEGSEKI I KEHRVFFSDKGPHH 
LCEKRLQLIEELCVKLPVRDPVRDTPGTCWITiKEIjRAAIDSTYRKL^ 
NETQLKGI KGEAIDTANHGEVKRAVEE IRNGVTKRGETLS 
SFKALVTLLSEVEKMLSNFGDCVQYKEIVKNSI^ 

KRISAKKRDVQQQIAQAQQGEGGLPDRGHEELRKLESTIiDGLERSRERQ 

VRYLFQTGSSHERFLSFSSLESLSSELEQTKEFSKRTESIAVQAENLVKEASEIPLGPQNKQLLQQQAKS 
I KEQVKKLEDTLEEE YVIDKS 

Human SYNE 1 Protein sequence - var7 (public gi: 2895593) (seq id no: 301 

MKQNLQKWQQFNSDLNSIWAWLGDTEEELEQLQRLEI^ 

CS PEFTQADS KESRDLQDRLSQMNGRWDRVCS LLEEWRGIjLQDAIiMQCQGFHEMSHGIjLLMIjENIDRRKN 
EIVPIDSNLDAEILQDHHKQLMQIKHEI^^ 

LLKEVSRHIKELEKIiLDVSSSQQDLSSWSSi^DELDTSGSVSPTSGRSTPNRQKTPRGKCSLSQPGPSVSS 
PHSRSTKGGS DSSLS E PGPGRSGRGFLFRVLRAALPLQLLIiljIjIiI GLACLVPMSEEDYS CALSNNFARS S 
TPCSDTRMALLHSELSRCHLQKCW 

Human SYNE I Protein sequence- var8 (public gi: 6330957) (seq id no: 302 

LDLCRQSNNLCLQREEDLQRTRDYHDCMNWEV 

KYAIEDLKDQKQKMIEHIJWIiDDKELVKEQTSHIiEQRWFQLEDLIKRKIQVSVTO 

WAEEQQPNIAEALKQS PPPDMAQNLLMDHIAI CSELEAKQMLLKSLIKDADRVMADLGIoNERQVIQKAIiS 
DAQSHVNCLSDLVGQRRKYLNKALSE KTQFLMAVFQATSQ I QQHERKIMFREHI CUdPDDVS KQVKTCKS 
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AQASLKTYQNEVTGIiWAQGRELM^^ 
EDFDKACHWLKQADIVTFPEINLMNESTELHTQLAI^ 
SYLSEKLNALPRQFNVIVALAKDK^ 
ALSLQi^CRAILDEVAGLGEAVDELN^ 

AYQEHEKMCQQLERQLKSVKEEQSKVNEETLPAEEKLKMYHSLAGSLQDSGIVL 
" "LAYEKARHQI QS WQGELKLLTSAI GETVTECESRMVQS I DFQTEMSRSLDWLRRVKAEIjSGPVYIiDL*NIjQ 
DIQEEIRKIQIHQEEVQSSLRIMNALSHKEKEKFTKAra^ 
^^^EIYSQCQRTCQVFQAANDWLE^AQEI^QIiAGNGI^^ 

LDPLIKPTGKEDLEQKVASLELRSQRMSRDSGAQVDLLQRCT 
I^KTSSSHEAEEKLSEHKALVSWNSFHEKIVALEEK^ 
AQDQEKILEDAVDEWTGFNNKVKKATEMIDQM^ 
GMLRQQTLSMIjQDGAAPTPGEEPPLMQEITAMQDRCIjNMQEK^ 

VQETKEYLGNPTIEIDAQLEEMILLTEATNHRQNIEKMAEEQKEKYIiGLYTILPSE^ — 
IRDQIQDKIKEVEQSKATSQELSRQIQKLAKDL^ 

LDAAVQKFLEQNGQLGKPIoAKKIGKLTELHQQTIRQAENRIiSKLNQAASHLEEYN 
LAHGTI AWNS ASQLREQYILHQVTLGKI I FKK 

Human SYNEl Protein sequence - var9 (public gi: 20521662) (seq id no: 303 

WISLMEWIQKDEDNIKNSIGYKAIHEYl^KYKGFKIDINCKQLTVDFTO 

FAEQLGAI^KSWQILQGLVTEKIQLLEGLLESWSEYENNVQCLKTWFETQEKRLKQQHRIGDQASVQNAL 
KDCQDLEDLI KAKEKEVEKI EQNGLALI QNKKEDVSS IVMSTLRELGQTWANLDHMVGQLKI LLKSVLDQ 
WSSHKVAFDKINS YLMEARYS LSRFRLLTGSLEAVQVQVDNLQNLQDDLEKQERSLQKFGS I TNQLLKEC 
HPPVTETLTNTLKEVNMRWNNIjIiEE I AEQLrQ S S KALLQLWQRYKDYS KQCASTVQQQEDRTNELLKAATN 
KDIADDEVATWIQDCNDLLKGLGTVKDSLFVLHELGEQL^ 

KQQTSLQAGVLDYETFAKSLEALEAWIVEAEEIMGQDPSHSSDLSTIQERMEELKGQMLKFSS 
' RLNELGYRLPLNDKE I KRMQNLNRHWS L I S SQTTERFS KLQS FLLQHQTFLEKCETWME FIiVQTEQKLAV 
EISGNYQHLLEQQRAHELFQAEMFSRQQIIiHSIIIDGQRIiLEQGQVDDRDEFNIiKLTLLSNQWQGVIRRA 
QQRRGI I DS QI RQWQR YREMAE KLRKWLVEVS YLPMSGLGS VP I PLQQARTLFDEVQFKEKVFLRQQGS Y 
ILTVEAGKQLLLSADSGAEAAIjQAELAEIQEKWKSASMRIjEE 

FKKKLSQSLPDHHEELHAEQMRCKELENAVGSWTDDLTQLSLLKDTLSAYISADDI S II1NERVELI1QRQW 

EELCHQLSLRRQQIGERLNEWAVFSEKNKELCEWLTQMESKVSQNGDILIEEM 

NKIQLQQMGERIJUCASHESKASEIEYKIiGKVNDRWQHLLDLIAARVKKLK^ 

AHIESELAKPIVYDSCNSEEIQRKLNEQQELQRDI EKHSTGVASVIjNLCEVIjLHDCDACATDAECDS IQQ 

ATRNLDRRWRNICAMSMERRLKIEETWRLWQKFIiDDYSRFEDWLKSSERTAAFP 

FEAFQRQVHECIiTQLELINKQYRRLARENRTDSACSLKQMVHEGNQRWDNM 

EFETARDS ILVWLiTEMDLQLTNI EHFSECDVQAKI KQLKAFQQEISLtNHNKIEQI I AQGEQLIEKSEPLD 
AAIIEEEI^EIiRRYCQEVFGRVBRYHKKfclRLPLPDDEHDI^ 

QPSSNLSLSLAQPLRSERSGRDTPASVDSIPLEWDHDYDLSRDLESAMSRALPSEDEEGQDDKDFYLRGA 
VALSGDHSALESQIRQLGKALDDSRFQIQQTENIIRSKTPTGPELDTSYKGYMKLLGECSSSIDSVKRLE 
HKLKEEEESLPGFVNIiHSTETQTAGVIDRWELIjQAQALS KELRMKQNLQKWQQFNSDLNS IW AWLGDTEE 
ELEQLQRLELSTDIQTIELQIKKLKELQKAVDHRKAIILSINLCSPEFTQADSKESRD 
DRVCSIiLEEWRGLLQDAIiMQCQGFHEMSHGLLIiMLENIDRRKNE I VP I DSNLDAE I LQDHHKQIiMQI KHE 
LLESQl^VASLQDMSCQLLVNAEGTDCIiEAKEKVHVIGNRLKLLLKEVSRHIK^ 

WSSADELDTSGSVSPTSGRSTPNRQKTPRGKCSIiSQPGPSVSSPHSRSTKGGSDSSLSEPGPGRSGRGFL 
FRVLRAAIiPLQLLLLIiIiI GLACIiVPMSEED YSCALSNNFARS FHPMLRYTNGPP PL 

Human SYNEl Protein sequence- varlO (public gi: 28195689) (seq id no: 304 

MTEKLQYLTSVYCTEKMSQQVAEI^RETEEIjRQMIKIRLQNLQDAAKDM 
TSPEVGRLSLKEQLSHRQHLIiSEMESLKPKVQAVQLCQSAL^^ 

AIQQCMIMQEAWQYEQYEQEMKHLQQLIEGAHREIEDKPVATSNIQELQAQISRHEEIiAQKIKGYQEQI 
ASLNSKCKI^TMKAKHATIVu^^ 

EEGTNS E I SS PPACRS PS PVANTDASVNQD I AYYQALSAERLQTDAAKI HPSTSASQEFYE PGLEPSATA 

KLGDLQRSWETLKNVI SEKQRTLYEALERQQ 

LMDEILMLQDEINELQSSIAEELVSESC^ 

LEEQRQEQALQRYRCEADELDSWLI^TKATLDTALSPPKEPMDMEAQLMDCQNltt 
HNENLLLEGKAHTKDEAEQLAGKLRRLKGSIiLELQRALHDKQ 

IiAQARTTWTQQRQSSLQQQKELEQELAEQKSLLRSVASRGEEILIQHSAAETSGDAGEKPDVLSQELG 
GEKSSAEDQMRMKWESLHQEFSTKQKLIiQNVX^ 

DGLNQAFEEVSSQSGGAKRQS IHLEQKLYDGVSATSTWLDDVEERLFVATALLPEETETCIiFNQE ILAKD 
I KEMSEEMDKNKNLFSQAFPENGDNRDVIEDTLGCL^ LNFQNDLKV 
LFTSLADNKYI ILQKLANVFEQPVAEQIEAIQQAEDGLKEFDAGI IEL 

MYDEIjMMI IGSRRSGLNQNIiTLKSQYERAI^DIiADIjIjETGQEKMAGDQKI IVSSKEEIQQPLDKHKEYFQ 
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GLESHMI LTVTLFRKI I S FAVQKETQFHTELMAQASAVLKRAHKRGVELEYI LETWSHIiDEDQQEIjSRQIi 

EWESS I PSVGLVEENEDRLIDRITIiYQHLKSSIiNEYQPKLYQVLDDGKRLLISI SCSDLESQLNQLGEC 

VHjSNTNKMSKELHRIjETIXjKHWTRYQSESADLIHWIjQSAKDRIjEFWTQQ 

SKEVDAQSSLKSSVLSTGNQLl^KKVDTAT^ 

ISEVMSWTSIMENAIQKDEDNIKNSIGYKAira 

rSDKTDFAEQLGAMNKSWQILiQGIiWEKIQIjIjEGLL^^ 

SVQNALKDCQDLEDLI KAKDKEVEKI EQNGLALI QTKKEDVSS IVMSTLRELGQTWANLDHMVGQIjKILIi 
KSVI^QWSSHKVAFDKINSYLME^ 

QIiIiKECHPPVTETIiTNTLKEVNMRWNNIjLEE IAEQLQS S KALLQLWQRYKDYSKQCASTVQQQEDRTNEIj 
LKAATNKDI ADDEVATW I QDCNDLLKGLGTVKDSIjFVIjHEIjGEQLKQQVDASAAS AI QSDQIiSLSQHLCA 
LEQALCKQQTSLQAGVLDYETFAKSLEAIJ^ 
MAPDLDRLireiiGYRIiPLNDKEIK^ 

EQKLAVE I SGNYQHLIiEQQRAHELFQAEMFSRQQIIiHS 1 1 IDGQRLLEQGQVDDRDEFNLKLTIiLSNQWQ 

GVIRRAQQRRGIIDSQIRQWQRYREMAEKLRKWLVEVSYIiPMSGLGSVP I PLQQARTLFDEVQ FKEKVFti 

RQQGSYIIiTVEAGKQIjLLiSADSGAEAAIjQAELAEIQEKWKSASMRIi^ 

LEKLRTFKKKLSQSLPDHHEELHAEQMRCKELEN^ 

LLQRQWEELCHQLSLRRQQIGERLNEWAVFSEKNKE^ 

IAIAQENKIQLjQQMGERIiAKASHESKASEIEYKLGKVNDRWQHLLDL^ 

SLRTWLAHIESELAKPIVYDSCNSEEIQRKLNEQ 

CDS IQQATRNLDRRWRNI CAMSMERRLKI EETWRLWQKFIjDDYSRFEDWIjKS SERTAAF PSSSGVI YTVA 
KEELKKFEAFQRQVHECIjTQLELINKQYRRIiARENRTDSAC^ 

FIGQREEFETARDSILVWIjTEMDLQIjTNIEHFSECDVQAKI KQLKAFQQE I SLNHNKIEQI IAQGEQLI E 
KSEPLDAAI I EEELDELRRYCQEVFGRVERYHKKLIRLPLPDDEHDLSDREIiEIjEDSAALSDIiHWHDRSA 
DSDLSPQPSSNLSLSLAQPIiRSERSGRDTPASVDSIPLEWDHDYDIiSRDLESAMSRALPSEDEEGQDDKD 
FYLRGAVALSGDHSAIjESQIRQIjGKAIjDDSRFQIQQTENI irsktptgpeldts ykgymkllgecsss id 
SVKRLEHKIjKEEEESLPGFVNLHSTETQTAGVIDRWEIjLQAQAL^ 

LGDTEEELEQLQRIiELSTDIQTIELQIKKLKEIiQKAVDHRKAIILSINLCSPEFTQADSKES 
•QMNGRWDRVCS LLEEWRGLIjQDALMQCQGFHEMSHGLIiIjMLENIDRRKNE I VPI DSNIiDAE ILQDHHKQIj 
MQIKHELLESQLRVASLQDMSCQLLWAEGTDCIiEAKEKVHVIGNRLKLLLKEVSRHIK^ 
QQDLSSWSSADEI^DTSGSVSPTSGRSTPNRQKTPRGKCSLSQPGPSVSSPHSRSTKGGSDSSIiSEPGPGR 
SGRGFLFRVLRAAIjPIjQLIiLLIiLIGIjACljVPMSEEDYSCAIjSNNFARSFHPML^ 

Human SYNE 1 Protein sequence- varll (public gi: 28195677) (seq id no: 305) 

MWAEDLSALRMAEDGCVDADLPDCNCDVTRARVK^ 
YDSCNSEEIQRKLNEQQEIjQRDIEKHSTGVASVLN^ 

CAMSMERRLKIEETWRLWQKFLDDYSRFEDWLKSSERTAAFPSSSGVIYTVAKEELKKFEAFQRQ 
TQLELINKQYRRLARENRTDSACSLKQMVHECa^QRWDNIjQKRVTS I LRRLKHFI GQREEFETARDS I LVW 
LTEMDLQLTNI EHFSECDVQAKIKQIjKAFQQEI SLNHNKIEQI I AQGEQLIEKSEPLDAAI IEEELDEIiR 
RYCQEVFGRVERYHKKLIRLPLPDDEHDLSDRELEIjEDSAALSDIiHWHDRSADSLLSPQPSSNLSLSLAQ 
PLRSERSGRDTPASVDSI PLEWDHDYDLSRDLESAMSRALPSEDEEGQDDKDFYLRGAVALSDVMI PESP 
EAYVKLTENAI KNTSGDHSALESQIRQLGKAIiDDSRFQIQQTENI IRSKTPTGPELDTS YKGYMKLLGEC 
S S S IDSVKRLEHKLKEEE E S LPGFVNLHSTETQTAGVI DRWELLQAQALS KEIJRMKQNIJQKWQQFNSDIJN 
SIWAWLGDTEEELEQLQRLELSTDIQTIELQIKKXKELQKAVD^ 

QDRLS QMNGRWDRVCS LLEE WRGLLQDALMQCQGFHEMSHGLLLMLENI DRRKNE I VP IDSNLDAE I LQD 
HHKQLMQI KHELLESQLRVASLQDMS CQIJjVNAEGTDCLEAKEKVHVI GNRLKIiLIiKEVSRHI KELEKLL 
DVSSSQQDLSSWSSADELDTSGSVSPTSGRSTPNRQKTPRGKCSLSQPGPSVSSPHSRSTKGGSDSSLSE 
PGPGRSGRGFLFRVLRAALPLQLIjLLLLIGIiACLVPMSEED YS CALSNNFARS FHPMLRYTNGPPPL 

Human SYNE 1 Protein sequence- varl2 (public gi: 28192628) (seq id no: 306) 

MATSRGASRCPRDIANVMQRLQDEQEIVQKRTFTK^ 

SGQKLPCEQGRRMKRIHAVANIGTALKFLEGRKIKLVNINSTDIADGRPS IVLGLMWTI ILYFQIEELTS 
NliPQLQSLSSSASSVDSIVSSETPSPPSKRKVTTKIQGNAKKALLK^A^QYTAGKQTGIEVK^ 
VAFHSVIHAI RPELVDLETVKGRSNRENLEDAFTIAETEIiGI PRLLDPEDVDVDKPDE KS IMTYVAQFLK 
HYPDIHNASTDGQEDDEILPGFPSFANSVQNFKREDRVIFKEMKVWIEQFERDLTRAQWESNIX3DKYQS 
FKHFRVQYEMKRKQ I EHLIQPLHRDGKLS LDQALVKQS WDRVTSRLFDWHIQIiDKSLPAPLGTI GAWLYR 
AEVALREEITVQQVHEETANTI QRKLEQHK 

Human SYNE1 Protein sequence - varl3 (public gi: 28192522) (seq id no: 307) 

HI QLDKSLPAPLGTIGAWLYRAEVALREE I TVQQVHEETANTIQRKLEQHKRKCRTMMDLLQNTDAHKRA 

FHEIYRTRSVNGIPVPPDQLEDMAERFHFVSPTSELHIJV^^ 

RESVEQLLQNWS FI ENSKFFEQYEVTYQI LKQTAEMYVKADGSVEEAENVMKF 

S VRSMLEEVI SNWDRYGNTVASLQAWLEDAEKMLNQS ENAKKDFFRNLPHWI QQHTAMNDAGNFLI ETCD 
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EMVSIttLKQQLLLI^GRWRELFMEV^^ 

KLLIQDLEDIEQRVPVMDAQYKIITKTAHLITKESPQEEGKEMFATMSKLKEQI.TKVKECYSPLLYESQQ 
IoLIPLEEIjEKQMTSFYDSLGKINEIITVIiEREAQSSAIiFKQKHQ 
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" Unigene Name : TTC3 Unigane ID : Hs. 118174 Clone ID : GD_1105 
Human TTC3 niRNA sequence - varl (public gi: 2687860) (SBQ ID NO: 202) 

ATTAAAATAAACATCTTCTGGCCACTTCTGTTTCAACA^ 
CCTGTGTGGACGCCAACAATTCACGTGCTTCTGAGATAA 

GGAAGATATTGTGGATTTGGCAAAGAAAGTTGCTAATGATTCATTCCTTATTGGAGGCTTATTGAGAATT 

GGTTGTAA^TAGAAAATAAAATCTTGGCAATGGAAGAAGCTCTGAATTGGATAAAATATGCAGGCGA 

TAACAATTCTAACTAAATTAGGATCA&TTGACAACT 

CAAGTACCACATAACTAAAATTGTAATGGAAGACTGCAATTTGCTTGAAGAACTTAAAACTCAAAG 

ATGGATTGTATAGAGGAAGGAGAACTAATGAAAATGAAAGGAAATGi^GAGTTTTCCAAAGAAAGATTTG 

ATATAGCTATTATCTATTACACCAGAGCCATTGAATATAGACCT 

AGCTCTTTGTTTTCTTCGTACTGGACAGTTTAGAAATGCACTCGGTGATGGAAAGAGAGCCACTATTCTG 
AAGAACACTTGGCCAAAGGGTCT^TTATCGTTATTGTGATGCTCITTCTATGCTGGGGGAATATGACTGGG 
CCCTGCAAGCAAACATAAAAGCTCAAAAACTCTGTAAAAAT^ 
GCAGCATGTAAAGTTACAAAAACAAATAGAAGACCTA^^ 

GCCTTTTATGAAAACAGGGCCTACACACCTAGGAGTTTATC^GCACCTATATTTACTACTTCACTTAAC^ 

TTGTGGAGAAGGAAAGAGATTTCAGAAAAATTAATCACGAAATGGCCAACGGTGGTAATCAGAATCTAAA 

GGTGGCGGATGAGGCGTTGAAGGTAGATGATTGTGACTGTCATCCTGAAT1TTCACCACCATCAAGTCAG 

CCTCCAAAACATAAAGGAAAACAAAAATCTCGAAACAATGAATCAGAAAAGTTC^ 

yGACTTTACCAGGAGATTTGAAGAACATCTTGC^ 

GGATTTTGCTAATATAATGAAAATGCTGAGAAGCTTAATTCC^GATGGCTATATGGCCTTATTGGAGCM 
CGTTGCCGCAGCGCTGCACAGGCCTTTACAGAGTTGCTC 

TGAACCTGGCCATGATTAACTATGTTTTGGTCGTCTATGGACTTGCCATTTCTCTCCTTGGAATAGGAC^ 
GCCTGAGGAATTATCT^AAGCCGAAAACCAGTTTAAGAGGATTATTGAACACTACCCCAGTGAGGGCCTT 
GATTGCTTGGCCTACTGTGGAATTGGAAAAGTATATTTGAAAAAAAACAGATTTCTAGAAGCTCTCAATC 
ACTTTGAGAAAGCAAGAACCTTGATTTATCGTCT^ 

TATTGAAGAGTCTCAGCCAGAAAAAATAAAGATGCTGTTAGAGAAATTTGTTGAAGAATGCAAGT^ 

CCAGTGCCAGATGCCATTTGTTGCTATCAGAAGTGCCATGGATATTCTAAGATCCAGATATACATAACTG 

ATCCAGACTTTAAGGGTTTTATACGCATCAG 

CTGGAAGAAGTTAAAAACTACAACCTTTAATGATAAAATTGACAAGGATTTTCTACAAGGAATATGTCTT 

ACCCCTGACTGTGAAGGTGTCATTTCTAAGATTATCATCTTCAGCAGTGGTGGTGAAGTTAAATGTGAAT 

TTGAACACAAGGTCATAAAAGAAAAGGTTCCTCCAAGACCTATTCTGAAACAGAAATGTTCTAGCCTAGA 

GAAACTAAGACTGAAAGAAGACAAAAAATTGAAGAGAAAGATCCAAAAAAAAGAAGCAAAAAA 

G^GAAAGAATGGAGGAGGACTTAAGAGAAAGTAATCGACCCAAAAA^^ 

ACAATGTTCAGCGTTGTGkGTTCCTTGATGACAGAATTC^ 

TAAATCCGGCATACAGAATACAGCCACGCTTCTC^^ 

GACTATACAACCTGTTTTTCTAGC^GAAATTTTCTAAATGAAG 

TTCAAGAAAATAACAGAGTAAAGACAAG/^ATATTTCTGCATGTTTTGAGTGAGCTTAAAGAAGTGGAGCC 

CAAATTAGCCGCCTGGATCCAAAAACTTAATAGCT 

TATGGAGCATCTCTTAAACTGCTTGATTTTAG 

AACTAGACTCTATAGAAGGAAAGCAACTTGATTATTTC 

TTTAATATGGCTGCTAGAAGAACACAGAGACAAGTTCC 

GATATAATGGACAGCCGCTGTACTGTGTTAAGGAAAC/^ 

AGGTGAAAAACAAAAGCAAGAAA^GAAGCCAAAGGATTCAAAGCCTATC 

TTCAGTAACTTCAAATAATGAGATCATCACTTCAAGTGAAGACCATAGC^^ 

GGCCCATTTGCAGTGCCTGACCATCTTCGGCAAGATC^ 

GTAACGAATATGTTGTCCGCAATAAGAAGCTATGGGACATC^ 

TGATTACTTCTCTCAGTTTTTGGAGGAACATGGTC^ 

TATGAGTTTTTCCCAGAAGAAACTCGACAGATACTAGAAAAAGCAGGAGGTTTAAAACCTTTTOT 

GATGCCCTCGTTTTGTTGTGATTGACAACTGTATTGCACTGAAGAAGGTTGCATCACGGCTCAAGAAAAA 

AAGGAAGAAGAAAAACATTAAAACAAAAGTAGAAGAAATTTCAAAAGCAGGGGAGTATGTAC 

CTAC^CTGAATCCAGCTGCTAGGGAATTTAAACCAGATGTAAAGTCTAAACCAGTGTC^ 

C^GCACCAGCTTTTGAAAATGTGAAACCCAAACCTGTGTCTGCAAATTCTC 

TGTGAAGGCCAAACCAGTATCCGACAATTCTTCTAG 

TCTTCTT^TTCTCCTAAACCAGGCTCTGAGGATGCAAATTACAAGCGAGTCTCCTGTAATTCCCCCA^ 
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CGGTTCTTGAGGATGTGAAACCAACITATTGGGCrCA 
TCCTTTCCAGAGATTTGATATCACCCAGACACCGCGAGCATAO 

TAC^CC^GC^TATATACACCCTTGGCCAGCCTTTOTCCTGAATATCAGCTACC^GATCAGTACCAGTGG 

TGCCGTCTTTTGTAGCCAATGACAGAGC^GAT^^ 

TGCTGAGAATGTTGCTGGTCACCAGATTGCCTCTGAAACACAGAT^ 

GTAAAGTGAGACTGCAGCACAGGTGATGCTC^ 

GTGGAAATTCTAACAACAAATGTGAAGTAATTCCAGAAAGC^ 

GCAGATGGTTGCCATACAGGTATCTTGGAACATAATACACCAAGAAGTCAATACTGAGCCATATAATCCT 

TTTGAGGAACGAC^GGGGAAATTTC^CXSGATTGAAAAGGAGC^CC^GTATTACAAGACCAACTTCAAG 

AAGTGTATGAAAATTATGAGCAGATAAAACTTAAGGGCTTAGAAGAGACCAGGGACCTGGAAGAGAAGTT 

GAAAAGGCACTTAGAAGAAAACAAGATCTCAAAGACGGAATTAGATTGGTTCCTTCA 

GAAATTAAAAAATGGCAACAGGAAAAAAAAGAAATCCAAGAAAGACTA 

AAAAGGTTTCAAATGCCAGTGAAATGTATACCCAGAAAAATGATGGAAAGGAAAAGGAACATGAATTACA 
TCTGGATCAGTCCCTTGAAATCAGCAAC^CACTTACAAATGAGAAAATGAAAATAGAAGAGTATATAAAG 
AAAGGGAAAGAGGATTATGAAGAGAGTCATCAGAGAGCTGTGGCT 

GGAAGGAGAGTGAAGTGTATAAGCTACAGATCATGGAGTCACAAGCAGAAGCCTTTCTGAAGAAGCTGGG 

GCTGATTAGCCGTGATCCTGCAGCATATCCTGACATGGAGTCTGATATACGTTCATGGGAATTGTTTCTT 

TCTAATGTTACAAAAGTAATTGAGAAAGC^AAGTCTCAGTTTGAAGAACAAATTAAGGCAAT^ 

GTTCTCGGCTCAGTGAACTTTCTAT^GTGCAGATTTCTGAGCTTTCATTTCCTGCCTGTAACACGGTTCA 

TCCCGAGTTACTCCCTGAGTCTTCAGGCGACGATGGCCAAGGGCTTGTGACTTCTGCAAGCGACGTGACT 

GGAAACCACGCAGCACTTCACAGGGATCCTAGTGTGTTCTCTGCTGGTGATTCCCCAGGGGAGGCTCCTT 

CTGCGCTGTTGCCAGGGCCACCCCCTGGTCAGCCTGAAGCCACTCAGOTGACAGGGCCAAAACGGGCTGG 

CCAGGCAGCTCTGTCAGAACGAAGCCCTGTGACTGATCGGAAGCAGCCTGTTCCTCCAGGACGTGCT 

CGTTCAAGCCAGTCTCCAAAAAAGCCGTTCAATAGTATTATTGAGCACCTGTCAGTGGTATTCCCATGTT 

ACAACAGCACTGAGCTTGCTGGTTTTATTAAAAAAGTGCGAAGCAAAAAC^GAACTCACTCTCAGGATT 

GAGTATTGATGAAATTGTCGAAAGAGTGACAGAACACATTCT^ 

CCAGGAAAGGACAAGAGGACTTATGAGCCCAGCTCTGCCACCCCCGTGACCAGGTCCTCCCAGGGCTCAC 
CCTCGGTGGTTGTTGCACCATCACCCAAAACCAAGGGGCAGAAAGCAGAAGATGTCCOT 
ACTGGGTGCAAGTTCCTGTGAAATATGCCACGAGGTGTTCAAATCAT^AAAACGTGCGTGTGCTCAAATGT 
GGGCAGAAXSTATCACAAAGGGTGCTTTAAGCAGT^ 

GTCGTGATCTCCTGACAGAAGAGTCACCTTCTGGAAGAGGCTGGCCCAGTCAGAATCAGGAGOTGCCTTC 

CTGCTCTTCTAGGTAGTCACACTTCACTAAAGTGTCATCCACCAGTGTGTTGAATCCGA^ 

TTTCTACCACTGGTGTAAAAAACAAACATTTGAAGACCCTTGTGCATTGTGTGTCA 

TGGAAATCGTTAATATCGCTGATATTAAGTAAT^ 

AGTGGCTAATAAAATGACGGCTACCACACTCATGGGTC^ 

GGCTTCTTTTGGAAAGTACTATGAACGTCTCGAAGCAGTATTCTAGTGATAAGAATTCTTAACATAGCCA 

AGCGCCCCACGTTTGTTCCCCAaSTTTGTTCCCCTTTTCTGTTTGAAAAACCTGTTCTGGTAGCT 

AGAGAGATGATACTGACTTTTTAAATTTTTTACAAGAGTCTGTATTCCTGATATGCCTATATTTTTCCTC 

AAAGATTCTGCATTTTAAGGATGGGCATAAGCAAACTATATTTTAATAATTTATAGTTAATGTTAAAATA 

TTGGCTGATTTAGACCAAAAGATTCAAATCTCCTCTT 

TTTTATGTTCCCCCGTTAGATTGTTTTAAGTGTTTGCTTTTCATCTTTTATAGATGTAATCTGATTTTCA 
AAAATCATTAACACTTTTTAATTAGTATCGACTAAGACTTTTTCCCCCTGGAATCGAGGCTGTGTGTCCG 
TCATCCCAGCCCCCGGTTGGAGCCTGCTCTTTGAACTCCGCTGCCTTCCTTAGCAGCTTCTGTCCTCTTC 
TGTGAGTCAGTCAGCGAGTGCT1XK3GATCCG(^TCCAGC^^ 

AATGGCCACCACCATTCTCCTTCCCCACCCCACGACAAAAAGAGAAGCTGTGTCTTTAGAC^ 
GTATCTGTGTTACAATCGTTCTGTGTTTGATATTTGTGTAAAGTATGC^TGCAGTCTTGTACTGTGACCT 
AAGAACAAAACTGTAACTGCATTAGAAACCATGAAAAAATT 
GTAAATATAGAACCATGAATTCTGGTCACATTC^^ 

CAATGTGTTCTTTGTTCCACTGGAAACTTAGAGTCATGAGTTTATGAGCTGATTTGGTCACCTTCCTCT^ * 

CCTTTGTTCACTGTGAGTTCTGATGTC^ 

GATTCTCC^CGGTGGTCCCCAAAACACTGTCTGC^^ 

GCATGAGGATCAGTTTGCAGCAGC7UVGTACAAAAGGAGAAGAGGAACATCCGTTGAAT 
TACATAACTTCAGATACTTGTGAACATGCCTTATATTTGTCCAACAACTGTCAGAATAAAGAACA 

Human TTC3 mRNA sequence - var2 (public gi: 1632765) (seq id no: 203) 

TACATTTGGAAAGTCTTACTGACATGCAGAAATAGTACAGA 
GGGCATGGTGGCTCACACCTOTAATCCC^G^^ 

GATCGAGACCATCGTGGCTAACATGGTGAAACCCTGTGCCTACTAAAAATTCAAAAAATTAGCCAGGTGT 
ACTGGCATGTGCTGTAATCCCAGCTACTTGGGAGGCTC 
GGTTGCAGTGAGCCAAGATCACXjCCACTGC7^CTCCAGCCTGGGCX3ACAG^ 
AAACAAAAGAATGCTATGCATAGGTACAATGTCGAAATGTGG^ 

GTTATTCCTGGGTAGTTGTGATTGTGACTCTGGGTGACTTTTTCCCTTTGTTTTATTTTTCT 
CAAATTTCCTATAATGGACATATATTATATGGATTTTTTTTAATTAAAATTATCTTTT 
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TACATGGAAAATTCACAAAGTACACATATGCAAGATA 

AAATACTTGTTTTTCCTTTTAGAAATGCACTCGGTGATGGAAAGAGAGCCACTATTCTGAAGAA 

GCCAAAGGGTCATTATCGTTATTGTGATGCTCTTTCTATGCTGGGGGAATATGACTGG 

AACATAAAAGOTCAAAAACTCTGTAAAAATGACCCTGAGGGAATCAAGGATCTAATT^ 

AGTTACAAAAACAAATAGAAGACCTACAAGGTCGAACAGCAAATAAGGATCCAATTAA 

AAACAGGGCCTACACACCTAGGAGTTTATGAGCACCTATATTTACTACTTCACTTAACT 

GAAAGAGATTTCAGAAAAATTAATCACGAAATGGCCAACGGTGGTAATCAGAATCTAAAGGTGGCGGATG 

AGGCGTTGAAGGTAGATGATTGTGACTGTCATCCTGAATTTT 

TAAAGGAAAACAAAAATCT CGAAACAATGAATCAGAAAAGTTCAGTTC TAGTTCACCATTGACTTTACCA 
GCAGATTTGAAGAACATCTTGGAGAAACAGTTTTCTAAATCT^ 

ATATAATGAAAATGCTGAGAAGCTTAATT(^GATGGCTATATGGCCTTATTGGAGC^GCGTTGCCGCM 

CGCTGCACAGGCCTTTACAGAGTTGCTGAACGGTTTAGATCCTCAAAAAATAAAGCAATT 

ATGATTAACTATGTTTTGGTCGTCTATGGACTTGCCATTTCTCTCCTTGGAATAGGACAGCCTGAGGAAT 

TATCTGAAGCCGAAAACC^GTTTAAGAGGATTATTGAACACTACCCCAGTGAGGGCCTTGATTGCTTGGC 

CTACTGTGGAATTGGAAAAGTGTATTTGAAAAAAAACAGATTTCTAGAAGCTCTCAATCACTTTGAGAAA 

GCAAGAACCTTGATTTATCGTCTTCCTGGAGTGTTAACTTGGCCCACGAGTAATGTGATTATTGAAGAGT 

CTCAGCCACAAAAAATAAAGATGCTGTTAGAGAAATTTGTTGAAGAATGCAAGTTCCCTCCAGT 

TGCCATTTGTTGCTATCAGAAGTGCCATGGATATTCTAAGATCCAGATATACATAACTGATCCAGACTTO 

AAGGGTTTTATACGCATCAGCTGTTGCCAGTACTGTAAAATAGAATTTCACATGAATTGCTGGAAGAAGT 

TAAAAACTACAACCTTTAATGATAAAATTGACAAGGATTTTCTACAAGGAATATGTCTTACCCCTGACTG 

TGAAGGTGTCATTTCTAAGATTATGATCTTCAGCAGTG^ 

GTCATAAAAGAAAAGGTTCCTCCAAGACCTATTCTGAAACAGAAATGTTCTAGCCTAGAGAAACTAAGAC 
TGAAAGAAGACAAAAAATTGAAGAGAAAGATCCAAAAAAAAGAAGCAAAAAAGTTAGCACAAGAAAGAAT 
GGAGGAGGACTTAAGAGAAAGTAATCCACCCAAAAATGAAGAGCAGAAAGAAACTGTAGACAATGTTCAG 
CGTTGTCAGTTCCTTGATGACAGAATTCTACAGTGTATAAAGCAGTATGCTGACAAGATTAAATCCGGCA 
TACAGAATACAGCCATGCTTCTCAAAGAATTGCTTTCTTGGAAAGTTTTGAGCACAGAAGACTATACAAC 
CTGTTTTTCTAGCAGAAATTTTCTAAATGAAGCAGTGGACTATGTTATTCGCCACTTGATTCAAGAAAAT 
AACAGAGTAAAGACAAGAATATTTCTGCATGTTTTGAGTGAGCTTAAAGAAGTGGAGCCCAAATTAGCCG 
CCTGGATCCAAAAACTTAATAGCTTTGGCTTAGATGCC^ 

TCTTAAACTGCTTGATTTTAGTATCATGACTTTCCTCTGGAATGAGAAATATGGTCACAAACTAGACTCT 

ATAGAAGGAAAGCAACTTGATTATTTCTCTGAGCCAGCATCATTGAAGGAAGCCCGTTG 

TGCTAGAAGAACAC7VGAGACAAGTTCCCAGCATTGCATAGTGCTTTAGATGAATTCTTTGATATAATGGA 

CAGCCGCTGTACTGTGTTAAGGAAACAAGATAGTGGTGAAGCACCGTTTAGTTCAACCAAGGTGAAAAAC 

AAAAGCAAGAAAAAGAAGCCAAAGGATTCAAAGCCTATGTTAGTTGGGTCTGGAACAACTTCAGTAACTO 

GAAATAATGAGATCATCACTTCAAGTGAAGACCATAGCAATCGAAATTGAGATTOT 

AGTGCCTGACCATCTTCGGCAAGATGTAGAAGAATTCGAAGCTCTCTATGACGAACACAGTAACGAATAT 

GTTGTCCGCAATAAGAAGCTATGGGACATGAACCCAAAACAAA 

CTCAGTTTTTGGAGGAACATGGTCCCTTGGACATGAGT^ 

CCCAGAAGAAACTCGACAGATACTAGAAAAAGCAGGAGGTTTAAAACCTTTTCTCTTGGGATGCCCTCGT 

TTTGTTGTGATTGACAACTGTATTGCACTGAAGAAGGTTGCATCACGGCTC 

AAAACATTAAAACAAAAGTAGAAGAAATTTGAAAAGCAGG 

TCCAGCTGCTAGGGAATTTAAACCAGATGTAAAGTCTAAACCAGTGTCAGATTCATCTTCAGCACCAGCT 

TTTGAAAATGTGAAACCCAAACCTGTGTCTGCAAATTCTCCCAAGCCAGCTTGTGAAGATGT 

AAC CAGTATCCGACAATTCTTCTAGACAAGTTT CTGAGG ATGGGCAACC CAAAGGGGTCTCTTCTAATTC 

TCCTAAACCAGGCTCTGAGGATGCAAATTACAAGCGAGTCTCCTGTAATTCCCCCAAACCGGTTCTTGAG 

GATGTGAAACCAACTTATTGGGCTCAATCCCATTTGGT 

GATTTGATATCACCCAGACACCGCCAGGATACATAAACGTGTTACC^ 

ATATACACCCTTGGCCAGCCTTTCTCCTGAATATCAGCTACCAAGATCAGTACCAGTGGTGCCGTCTTTT 
GTAGCCAATGAC^GAGCAGATAAAAATGCTGCTGCCTATTTTGAGGGTCATCATTTGAATGCTGAGAATG 
TTGCTGGTCACCAGATTGCCTCTGAAACACAGATCCTTC 

CTGCAGCACAGGTGATGCTCATAGAGTCCTGAGTGAGTCTAACAGAAATGATGAGC^CTGTGGAAATTCT 
AACAACAAATGTGAAGTAATTCCAGAAAGCACCAGTGCAGTAACAAACATTCC^ 
CCATAGAGGTATCTTGGAACATAATAGACCAAGAAGTCAATACTGAGCCAT^ 
ACAAGGGGAAATTTCACGGATTGAAAAGGAGCACCAAGTATTACAAGACCAA 

AATTATGAGCAGATAAAACTTAAGGGCTTAGAAGAGACCAGGGACCTGGAAGAGAAGTTGAAAAGGCACT 
TAGAAGAAAACAAGATCTCAAAGACGGAATTAGATTGGTTCCTTCAAGATTTGGAAAGAGAAATT^^ 

' ATGGCAACAGGAAAAAAAAGAAATCCAAGAAAGACTAAAATCACTGAAG 
AATGCCAGTGAAATGTATACCCAGAAAAATGATGGAAAGGAAAAGGAACATGAATTACATCTGGATC^ 

CCCTTGAAATCAGCAACACACTTACAAATGAGAAAATGAAAATAGAAGAGTATATAAA 

GGATTATGAAGAGAGTCATCAGAGAGCTGTGGCTGCAGAGGTATCCGTACTTGAAAACTGGAAGGAGAGT 

GAAGTGTATAAGCTACAGATCATGGAGTCACAAGCAGAAGCCTTTCTGAAGAAGCTGGGGOT 

GTGATCCTGCAGCATATCCTGACATGGAGTCTGATATACGTTCATGGGAATTGTTTCTTTCTAATGTTAC 

AAAAGAAATTGAGAAAGCAAAGTCTCAGTTTGAAGAACAAATTAAGGCAATTAAAAATGG 
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AGTGAACTTTCTAAAGTGCAGATTTCTGAGCTTTC^ 

TCCCTGAGTCTTCAGGCCACGATGGCCAAGGGCTTGTGACTTCTGCAAGCGACGTGACTGGAAACCACGC 
AGCAC^C^CAGGGATCCTAGTGTGTTCTCTGCTGGTGATTCCC^GGGGAGGCTCCTTCTGCGCTGTTG 
CCAGGGCCACCCCCTGGTCAGCCTGAAGC 

TGTCAGAACGAAGCCCTGTGGCTGATCGGAAGCAGCCTGTTCCTCCAGGACGTGCTGCGCGTTCA^ 
" GTCTCGAAAAAAGC03TTCAATAGTATTATTGAGCACCTGTCAGTGGTATTCCCATGTTACAA 

GAGCITGCTGGTTTTATTAAAAAAGTGCGJ^^ 
^AAATTGTCCAAAGAGTGAC^GAACACATTCTAGATGAACAGAA 

CAAGAGGACTTATGAGCCCAGCTCTGCCACCCCCGTGACCAGGTCCTCCCAGGGCTCACCCTCGGTGGTT 

GTTG(^CCATCACCCAAAACCAAGGGGCAGAAAGCAGAA 

GTTCCTGTGAAATATGCCACGAGGTGTTCAAATCAAAAAACGTGCGTGTGCTCAAA 

TCACAAAGGGTGCTTTAAGCAGTGGCTTAAAGGGCAGAGCGCTTGCCCGGCCTGCCAGGGTCGTGATCTC 
CTGACAGAAGAGTCACCTTCTGGAAGAGGCTGGCCCAGTCAGAATC^GGAGCTGCCTTCCTGCTCTTCTA 
GGTAGTCACACTTGACTAAAGTGTCATCC^ 

GGTGTAAAAAACAAACATTTGAAGACCCTTGTGCATTGTGTGTCACAAAGCTAAATACATG 
AATATCGCTGATATTAAGTAATTTCCCCACTCTGAGTGAA 

AAATGACGGCTACCACACTCATGGGTC^CTGGGGCTGCGCAGGGCTCTTTGAGGTGGGTGGCTTCTTTTG 
GAAAGTACTATGAACGTCTCGAAGCAGTATTCTAGTGATAAGAATTCTTAACATAGCCAAGCGCCCCACG 
TTTGTTCCCCACGTTTGTTCCCCTTTTCTGTTTGAAAAACCTGTTCTGGTAGCTCCACAAGAGAGATGAT 
ACTGACTTTTTAAATTTTTTACAAGAGTCTGTAT^ 

ATTTTAAGGATGGGCATAAGCAAACTATATTTTAATAATTTATAGTTAATGTTAAAATATTGGCTGATTT 
AGACCAAAAGATTCAAATCTCCTCTTTGTGAAATCCCATCT 

CCCGTTAGATTGTTTTAAGTGTTTGCTTTTCATCTTTTATAGATGTAATCTGATTTTCAAAAATCA 

CACTTTTTAATTAGTATCGACTAAGACTTTTTCCCCCTGGAATCGAGGCTGTGTGTCCGTCATCCCAGCC 

CCCGGTTGGAGCCTGCTCTTTGAACTCCGCTGCCTTCCTTAGCAGCTTCTGTCCTCTTCTGTGAGTCAGT 

CAGCGAGTGCTTGGGATCCGCATCCAGCCGTGCTGAGCACACAACAGGCTGTGTGTGGAAATGGCCACCA 

CCATTCTCCTTCCCCACCCCACCACAAAAAGAGAAGCT^ 

ACAATCGTTCTGTGTTTGATATTTGTGTAAAGTATGCATGCAGTCTTGTACTGTGACCTAAGAACAAAAC 

TGTAACTGCATTAGAAACCATGAAAAAATTAGATATTGTTTTGTGACTTTTAGACA 

ACCATGAATTCTGGTCACATTCCATTTCTCTCCAACATGAAGGATCAA 

TTGTTCCACTGGAAACTTAGAGTCATGAGTTTATGAGCTGATTTGGTGACCTTCCTCTGCCTTTG 
TGTGAGTTCTGATGTCTTAGTGACTTAGTTCTTAGAAGCT 

GTGGTCCCCAAAACACTGTCTGCATATCCATAAGAATTGAGCGCTATGGGTGTTAACGTGCATGAGGATC 

AGTTTGCAGCAGCAAGTACAAAAGGAGAAGAGGAACATCCGTTGAATGAGTGTGTTT 

AGATACTTGTGAACATGCCTTATATTTGTCCAACAACTGTCAGAATAAAGAACATTCTAAAA 

Human TTC3 mRNA sequence - var3 (public gi: 1632763) (seq id no: 204 

CTGAACTAGTTGCC^GTGATCTTGAAACGTGACAGTAACCAAGAGATAAATAGGTGACAATGAC^GGA^ 
ATTAGATGTAGTAAAAGAGAGTGTTTGAGAGCAGAAGCTATGGCAACTAAAGACTGGATTTGAATCCTTC 
CTAGCTTGGTGACATGAGCAAATTACTTGATTTAAGTG 

ATAATTGTGCCTGCTAAGAAGAATTGCTGTGAAGATTAGTGAAATAATGC^TGTAAAACATTTGGTAGA^ 
TATGTGACACATAGTACAAATAGTTTGCTAGGAAGATTGTTATTATTCTTCACTTGTGATATTGTGAAGT 
TTTCATAC^GCAAATTGGACATCATGAGATGGATTGATTAAATAAATAGATTTGAACTTCAAGGACTGGT 
AGTGTTCTTGCTTTGGAAAGAAGAAACTTGGTTTATCCTAATAATAGTAGGATAATAATGGTGAAGTGAT 
AGGTACAAGTAATAGTGTTTATGATGCGCTGGTGATGATAGGAAAAGAAAGCCATTATATGGGCAAGAGC 
TAGAAGTAATAAAATGGTG(^TTTTTCAGTGATGTTTGGCCTATGTAGCTATTCTCTGATAACTATAAAA 
ATCCTTATTATTGAAGATTCTTCAGGAAAAAAAAACCCTTA 

CCCCCCATTGAAATACGTATTTTTAAAACATGGCTTTTGATAATGTGAGGGTTTTTTCCTTTTTG 

TAGCAGTGCTGATTGTGTATTGCAGTAGTTGTGAGAGCATTAGAAGCAGCAGTCGATAGGAGGATGGAAG 

GTCTGGATGCCGCCTTGGGGAGTTAGGAGATTGGGAGACTTACCCT^ 

GCCCAAGACAGAAACACACTGAGATGGATAGGAGAATATGAGCAGTTGATAGGAAAG 

CAGGATTTAGGTTAGGCCAGGAGATTGAGAATATAA(^VGTTTGTGTATGATGAAATGGCATATTTCAC^G 

AATGCAGTAAAAGCAGGTAGGGTACAAGTGCAGCAACAGGAAGATC 

TATTTGAGAGCTTACCATGTGCTAGGGACATACAAAGAT^ 

AAGGAGACATGTAAACAGGTTAACTTAGAGTAGAGATGGTGAATATC 

AGATTAAATTATCTGGAGAGAGAGGAAAAGTCAGCAGAATGGGGACGAGAATCTT^ 

CTGATAGGAGTTATTTCCTTGGGCATAGGTTCCAAGTATTTTTCTAATATACCATAG 

TTTCTTCTGTTATCTCAAATGATTTAATTACTGACTTGAGTTTGTGTTGTOT 

TGGAC^TTTTGCTGAGGGAGATTTCACTGTGGCGGATTATGCCI^GTTAGAAGATTGCCCTCACGTGGA 
TGATTGTGTCTTTGCTGCTGAATTTATGAGCAATGATTATGTTCGTGTGACTCAGOT 
GTGGGTGTGCAATATAAAGATTATATCCAAAGTGAGAGGAATTTGGAATTTGAGATCTGCAGTATATGGT 
GTAGTAAACCAATTTCTGTCCTG CAAG ATTATTGCGATGCCATTAAAATAAACATCT TCTGGCCACTTCT 
GTTTCAACATCAAAACAGTTCCGTAATATCACGATTGCATCCCTGTGTGGACGCCAACAATTCACGTGCT 
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^GCT^TGATTCA 

AATGGAAGAAGCTCTGAATTGGATAAAATATGCAGGCGATGTAAC^ 

GACAATTGTTGGCCTATGTTAAGTATTTTCTTTACTGAATGAAGGAGAACTAATGAAAATGAAAGGAAAT 

GAAGAGTTTTCCAAAGAAAGATTTGATATAGCTATTATCTATTACAC 

AAAACTACCTTCTTTATGGTAACCGAGCTCTTTGTTTT 

TGATGGAAAGAGAGCCACTATTCTGAAGAACACTTGGCCAAAGGGTCATT 

TCTATGCTGGGGGAATATGACTGGGCCCTGCAAGCAAACATAAAAGCTCAAAAAC 

CTGAGGGAATCAAGGATCTAATTCAGCAGCATGTAAAGTTACAAAAACAAATAGAAG 

AACAGCAAATAAGGATCCAATTAAAGCCTTTT ATGAAAACAG GGC CTACACACCTAGGAGTTT ATCAGCA 

CCTATATTTACTACTTCACTTAACTCT 

CCAACGGTGGTAATCAGAATCTAAAGGTGGCGGATGAGGCGTTGAAGGTAGATGATTGTGACTGTCATCC 

TGAATTTTCACCACCATGAAGTCAGCCTCCAAAAGATAAAG 
GAAAAGTTCAGTTCTAGTTGACGATTGACTTTACCM 

CTAAATCTTCCAGAGCTGCACACCAGGATTTTGCTAATATAATGAAAATGCTGAGAAGCTTAATTCAAGA 
TGGCTATATGGCCTTATTGGAGCAGCGTTGCCXSCAGCGCT^ 

TTAGATCCTCAAAAAATAAAGCAATTGAACCTGGCC^TGATTAACTATGTTTTGGTCGTCTATGGACTT^ 
CCATTTCTCTCCTTGGAATAGGACAGCCTGAGGAATTATCTGAAGCCGAAAACCAGTTTAAGAGGATTAT 
TGAACACTACCCCAGTGAGGGCCTTGATTGCTTGGCCTACTGTGGAATTGGAAAAGTGTATTTGAAAAAA 
AACAGATTTCTAGAAGCTCTCAATCACTTTGAGAAAGCAAGAACCTTGATTTATCGTCTTCCTGGAGTGT 
TAACTTGGCCCACGAGTAATGTGATTATTGAAGAGTCTCAGCCACAAAAAATAAAGATGCTGTTAGAGAA 
ATTTGTTGAAGAATGCAAGTTCCCTCCAGTGCCAGATGCCATTTGTTGCTATCAGAAGTGCCATGGATAT 
TCTAAGATCCAGATATACATAACTGATCCAGACTTTAAGGGTTTTATACGCATCAGCTGTTGCCAGTACT 
GTAAAATAGAATTTCACATGAATTGCTGGAAGAAGTTAAAAACTACAACCTTTAATGATAAAATTGACAA 

GGATTTTCTACAAGGAATATGTCTTACCC^ 

AGTGGTGGTGAAGTTAAATGTGAATTTGAACACAAGGTGATAAAAGAAAAGGTTCCTCCAAGACCTATTC 

TGAAACAGAAATGTTCTAGCCTAGAGAAACTAAGACTGAAAGAAGACAAAAAATTGAAGAGAAAGATCCA 

AAAAAAAGAAGCAAAAAAGTTAGCACAAGAAAGAATGGAGGAGGACTTAAGAGAAAGTAAT 

AATGAAGAGCAGAAAGAAACTGTAGAGAATGTTCAGCGTTGTC^GTTCCTTGATGACAGAATTCTACAGT 

GTATAAAGCAGTATGCTGACAAGATTAAATCCGGCATACAGAATACAGCCATG 

TTCTTGGAAAGTTTTGAGCACAGAAGACTATACAACCT^ 

GTGGACTATGTTATTCGCCACTTGATTCAAGAAAAT^ 

TGAGTGAGCTTAAAGAAGTGGAGCCCAAATTAGCCGCCTGGATCCAAAAACTTAATAGCTTTGGCTTAGA 
TGCCACAGGAACTTTCTTTTCTCGTTATGGAGCATCTCTTAAACTGCTTGATTTTAGTATCATGACTTTC 

CTCTGGAATGAGAAATATGGTCACAAACTAGACTCTATAGAAGGAAAGCAACTTGAT^ 
CAGCZATCATTGAAGGAAGCCCGTTGTTTAATATGGCTGCTAGAAGAACACAGAGACAAGTTCCCAGCATT 
GCATAGTGCTOTAGATGAATTCTTTGATATAATGGACAGCCGCTGTACTGTGTTAAGGAAACAAGATAGT 
GGTGAAGC^CCGTTTAGTTCAACCAAGGTGAA;JUVCA7^AAGCAAGAAAAAGAAGCCAAAGGATTC7^AAGC 

CTATGTTAGTTGGGT CTGGAACAACTT CAGTAACTTC AAATAATGAGAT CATCACTTCAAGTG AAGACCA 
TAGCAATCGAAATTCAGATTCTGCAGGCCCATTTGCAGTGCCTGACCATCTTCGGCAAGATGTAGAAGAA 

TTCGAAGCTCTCTATGACCAAGACAGTAACGAATATGTTGT^ 

CAAAACAAAAATGTTCAACTCTATATGATTACTTCTCTCAGTTTTTGGAGGAACATGGTCCCTTGGACAT 

GAGTAACAAGATGTTCTCTGCAGAATATGAGTTTTTCCCAGAAGAAACTCGACAGATACTAGAAAAAGCA 

GGAGGTTTAAAACCTTTTCTCTTGGGATGCCCTCGTTTTGTTGTGATTGACAACTGTATTGCACTG 

AGGTTGCATCACGGCTCAAGAAAAAAAGGAAGAAGAAAAACATTAAAACAAAAGTAGAAGAAATTTCAAA 

AGCAGGGGAGTATGTACGAGTTAAACTACAACTGAATCCAGCTGCTAGGGAATTTAAACCAGATGTAAAG 

TCTAAACCAGTGTCAGATTCATCTTCAGCACCAGCTTTTGAAAATGTGAAACC 

ATTCTCCCAAGCCAGCTTGTGAAGATGTGAAGGCCAAACCAGTATCCGAGAATT 

TGAGGATGGGCAACCCAAAGGGGTCTCTTCT71ATTCTCCTAAACCAGGCTCTGAGGATGCAAATTAGAAG 
CGAGTCTCCTGTAATTCCCCCAAACCGGTTCTTGAGGATGTGAAACCAACTTATTGGGCTCAATCCCATT 
TGGTCACAGGATACTGTACGTATCTTCCTTTCCAGAGATTTG 

AAACGTGTTACCAGGTTTGCCCCAGTACACCAGCATATATACACCCTTGGCCAGCCTTTCTCCT^ 
CAGCTACCAAGATCAGTACCAGTGGTGCCGTCTTTTGTAGCCAATGA 

CCTATTTTGAGGGTCATCATTTGAATGCTGAGAATGTTGCTGGTC^CCAGATTGCCTCTGAAACACAGAT 
CCTTGAGGGCTCTTTGGGAATATCTGTAAAGTCACA^ 

GAGTCTAACAGAAATGATGAGCACTGTGGAAATTCTAAC^CAAATGTGAAGTAATTCCAGAAAGCACCA 
GTGCAGTAACAAACATTCCACACGTGCAGATGGTTGC 

AGTCAATACTGAGCCATATAATCCTTTTGAGGAACGACAAGGGGAAATTTCACGGATTGAAAAGGAGCA 
CAAGTATTACAAGACGAACTTCAAGAAGTGT^ 

AGACCAGGGACCTGGAAGAGAAGTTGAAAAGGCACTTAGAAGAAAACAAGATCTCAAAGACGGAATTAGA 

TTGGTTCCTTCAAGATTTGGAAAGAGAAATTAAAAAATGGCAACAGGAAAAAAAAGAAATCCAAG 

CTAAAATCACTGAAGAAGAAAATTAAAAAGGTTTCAAATGCCAGTG^^ 

GAAAGGAAAAGGAACATGAATTACATCrGGATCAGTCCCTTGAAATCAGCAACACACTTACAAATGAGAA 
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AATGAAAATAGAAGAGTATATAAA 

GCAGATOTATCCGTACTTCSAAAACTGGAAGGAGMTGA^ 

CAGAAGCCTTTCTGAAGAAGCTGGGGCTGATTAGCCGTGATCCTGCAGCATATCCTGACATGGAGTCTGA 
TATACGTTCATGGGAATTGTTTCTTTCTAATGTT^ 

GAACAAATTAAGGCAATTAAAAATGGTTCTCGGCTCAGTGAACTTTCTAAAG 
CATTTCCTOCCTGTAACACGGTTCATCCCGAGTTACTCCCTGAGTC 

TGTGACTTCTGCAAGCGACGTGACTGGAAACCACGCAGCACTTCACAGGGATCCTAGTGTGTTCTCTGCT 

GGTGATTCCCCAGGGGAGGCTCCTTCTGCGCTGTTGCCAGGGCCACCCCCTGGTCAGCCTGAAGCCACT 

AGCTGACAGGGCCA7U^CGGGCTGGCCAGGCAGCTCTGTCAGAACGAAGCCCTGTGGCTGATCGGAAGCA 

GCCTGTTCCTCCAGGACGTGCTGC^ 

C^CCTGTCAGTGGTATTCCCATGTTACAACAGCACrGAGCTTGCTG^ 
AAAACAAGAACTCACTCTCAGGATTGAGTATTGAT^^ 

TGAACAGAAAAAGAAAAAGCCAAACCCAGGAAAGGACAAGAGGACTTATGAGCCCAGCT 
GTGACCAGGTCCTCCCAGGGCTCACCCTCGGTGGTTGTTC 

CAGAAGATGTCCCTGTGAGGATTGCACTGGGTGCAAGTTCCTGTGAAATATGCCACGAGGTGTTCAAATC 
AAAAAACGTGCGTGTGCTCT^TGTGGGCACAAGTATCACAAAGGGTGCTTTAAGCAGTGGCTTAAAGGG 
CAGAGCGCTTGCCCGGCCTGCCAGGGTCGTGATCTCCTGACAGAAGAGTCACCTTCTGGAAGAGGCTGGC 

CCAGTCAGAATCAGGAGCTGCCTTCCTGCTCTT 

TGTGTTGAATCCGAAGAATGACAATTTTCTACCACTGGTGTAAAAAACAAACATTTGAAGACCCTTG 
ATTGTGTGTCACAAAGCTAAATACATGGAAATCGTTAATATCGCTGATATTAAGTAATTTCCCCACTCTG 
AGTGAATACTTTGATGATTGCCAACAGTGGCTAATAAAATGACGGCTACCACACTCATGGGTCACTGGGG 
CTGCGCAGGGCTCTTTGAGGTGGGTGGCTTCTTTTGGAAAGTACTATGA 

AAAAACCTGTTCTGGTAGCTCCACAAGAGAGATGATACTGACTTTTTAAATTTTTTACAAGAGTCTGTAT 

TCCTGATATGCCTATATTTTTCCTO^AAGATTCTGCATTTTAAGGATGGGCATAAGC^ 
ATAATTTATAGTTAATGTTAAAATATTGGCTGATTTAGACCAAAAGATTCAAATCTCCTCTTTGTGAAAT 

CCCMCTGCATTTGATT^^ 
TTTTATAGATGTAATCTGATTTTC^AAAATC^T^ 

CCCTGGAATCGAGGCTGTGTGTCCGTCATCCCAGCCCCCGGTTGGAGCCTGCTCTTTGAACTCCGCTGCC 



GAGCACACAACAGGCTGTGTGTGGAAATGGCCACG^^ 

AGCTGTGTCTTTAGACAACCCTGAGGTATCTGTGTTACAATCGTTCTGTGTTTGATATTTGTGTAAAGTA 
TGCATGCAGTCTTGTACTGTGACCTAAGAACAAAACTGTAACTGCATTAGAAACCATGAAAAAATTAGAT 
ATTGTTTTGTGACTTTTAGAC^GTGGTAAATATAGAACC^TGAATTCTGGTCAaVTTCCATTTCTCTC^ 
ACATGAAGGATCA/^AAAATGTTTTTCAATGTGTTCTTTGTTCCACTGGAAACTTAGAGTCATGAGTTTAT 

GAGCTGATTTGGTGACCTTCCTCTGCCTTTGTTC^ 

GAAGCTCACGCCTTAGTTTGAAACAGATTCTCCACGGTGGTCCCCAAAACACTGTCTGCATATCCATAAG 
AATTGAGCGCTATGGGTGTTAACGTGCATGAGGATCAGTTTGCAGCAGCAAGTACAAAAGGAGAAGAGGA 
ACATCCGTTGAATGAGTGTGTTTTGTACATAACTTCAGATACTTGTGAACATGCCTTATATTTGTCCAAC 
AACTGTCAGAATAAAGAACATTCTAAAATGAG 

Human TTC3 mRNA sequence - var4 (public gi: 163276D (seq id no= 205) 

CTGAACTAGTTGCCAGTGATCTTGAAACGTGACAGT^ 

ATTAGATGTAGTAAAAGAGAGTGTTTGAGAGCAGAAGCTATGGCAACTAAAGACTGGATTTGAATCCTTC 
CTAGCTTGGTGACATGAGCAAATTACTTGATTTAAGTGAGCATTTTCCCATCTO 

ATAATTGTGCCTGCTAAGAAGAATTGCTGTGAAGATTAGTGAAATAATGCATGTAAAACATTTGGTACAG 
TATGTGACACATAGTACAAATAGTTTGCTAGGAAGATTGTTATTATTCTTCACTTGTGATATTGTGAAGT 
TTTCATACAGCAAATTGGACATCATGAGATGGATTGATTAAATAAATAGATTTGAACTTCAAGGACTGGT 
AGTGTTCTTGCTTTGGAAAGAAGAAACTTGGTTTATCCTAATAATAGTAGGATAATAATGGTGAAGTGAT 
AGGTACAAGTAATAGTGTTTATGATGCGCTGGTGATGATAGGAAAAGAAAGCCATTATATGGGCAAGAGC 
TAGAAGTAATAAAATGGTGCATTTTTCAGTGATGTTTGGCCTATGTAGCTATTCTCTGATAACTATAAAA 

ATCCirATTATTGAAGATTCTTCAGGAAAAAAAAACCCITAGTCTGA^ 
CCCCCCATTGAAATACX3TATTTTTAAAACATC 

TAGCAGTGCTGATTGTGTATTGCAGTAGTTGTGAGAGCATTAGAAGCAGCAGTCGATAGGAGGATGGAAG 
GTCTGGATGCCGCCI^GGGGAGTTAGGAGATTGGCAGACTTACCCTGTACC^ 

GCCC^GACAGAAACACACTGAGATGGATAGGAGAATATGAGCAGTTGATAGGAAAGTTCTCAGTGGAGT 

CAGGATTTAGGTTAGGCCAGGAGATTGAGAATATAACAGTTTGTGTAT 

AATGCAGTAAAAGCAGGTAGGGTACAAGTGCAGCAACAGGAAGATGTCTTO 

TATTTGAGAGCTTACCATGTGCTAGGCACATACAAAGATAAATAAGATGCCCTTGATGA 

AAGGAGACATGTAAACAGGTTAACTTAGAGTAGAGATGGTGAATATGTGAACCTGAGGAAAGGAAGAAAT 

AGATTAAATTATCTGGAGAGAGAGGAAAAGTCAGCAGAATGGGGACGAGAATCTTTCGGAGOT 

CTGATAGGAGTTATTTCCTTGGGCATAGGTTCCAAGTATTTTTCTAATATACCATAGAA 

TTTCTTCTGTTATCTCAAATGATTTAATTACTGACTTGAGTTTGTGTTGTCT 
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TGGACAATTTTGCTGAGGGAGATTTCACTGTGGCGGATTATGCCTTGTTAGAAGATTGCCCTCACGTGGA 

TGATTGTGTCTTTGCTGCTGAATTTATGAGCA^ 

GTCGGTGTGC^TATAAAGATTATATCCAAAGTGAGAGGAATTTG^ 

PTAGTAAACCAATTTCTGTCCTGCAAGATTATTGCGA 

G^TCAAC^TCAAAACAGTTCCGTAATATCACGATTGC^ 

-TCTGAGATAAATTTGAAGAAACTACAACATCTTG^ 

TTGCTAATGATTC^TTCCTTATTGGAGGCTTATTGAGAATTGG 

^^•ix^GAATTCTCTGAATTGGATAAAATATC 
^^GACAATTGTTGGCCTATGTTAAGTATTTTCTTTACTGAATACAAGTACCACATAACTAAAATTGTAATGG 

AAGACTGCAATTTGCTTGAAGAACTTAAAACTCAAAGTTGTATGGATTGTATAG 

GAAAATGAAAGGAAATGAAGAGTTTTCC^AAGAAAGATTTGATATAGC 

ATTGAATATAGACCTGAAAACTACCTTCTTTATGGTAACCGAGCTCTTTGTTTTCTTCGTACTGGACAGT 

rTAGAAATGCACTCGG^ . 
TTATTGTGATGCTCrTTCTATGCTGGGGGAAT^ 
CTCTGTAAAAATGACCCTGAGGGAATGAAGGATCTAAT^ 
AAGACCTACAAGGTCGAACAGCAAATAAGGATCCAATT^ 

TAGGAGTTTATCAGCACCTATATTTACTACTTCACTTAACTTTGTGGAGAAGGAAAGAGATTTCAGAAAA 
ATTAATCACGAAATGGCCAACX3GTGGTAATCAGAATCTAAAGGTGGCGGATGAGGCGTTGAAGGTAGATG 

ATTGTGACTGTCATCCTGAATTTTCACCACCATCAAGTCAGCCT 
TCGAAACAATGAATCAGAAAAGTTCAGTTCTAGTTCACGATTGACTTTACCAGCAG 
TTGGAGAAACAGTTTTCTAAATCTTCCAGAGCTGCACACGAGGATTTTGCTAATATAATGAA 
GAAGCTTAATTCAAGATGGCTATATGGCCTTATTGGAGCAGCGTTGCCGCAGCGCTGCACAGGCCTTTAC 

AGAGTTGCTGAACGGTTTAGATCCTCAAAAAATAAAGCAATTGAACCT 

GTCGTCTATGGACTTGCCATTTCTCTCCTTGGAATAGGACAGCCTGAGGAATTATCTGAAGCCGAAAACC 
AGTTTAAGAGGATTATTGAACACTACCCCAGTGAGGGCCTTGATTGCTTGGCCTACTGTGGAATTGGAAA 

AGTGTATTTGAAAAAAAACAGATTTCTAGAAGCTCTCAATCA 

CGTCTTCCTGGAGTGTTAACTTGGCCCACGAGTAATGTGATTATTGAAGAGTCTCAGCCACAAAAAATAA 

AGATGCTGTTAGAGAAATTTGTTGAAGAATGCAAGTTCCCTCCAGTGCCAGATGCCATTTGTTGCTATCA 

GAAGTGCCATGGATATTCTAAGATCCAGATATAC^TAACTGATCCAGACTTTAAGGGTTTTATACGCATC 

AGCTGTTGCCAGTACTGTAAAATAGAATTTCACATGAATTGCTGGAAGAAGTTAAAAACTACAACCTTTA 

ATGATAAAATTGACAAGGATTTTCTACAAGGAATATGTCTTACCCCTGACTGTGAAGG 

GATTATCATCTTCAGCAGTGGTGGTGAAGTTAAATGT 

CCTCCAAGACCTATTCTGAAACAGAAATGTTCTAGCCTAGAGAAACTAAGACTGAAAGAAGACAAAAAAT 

TGAAGAGAAAGATCCAAAAAAAAGAAGCAAAAAAGTTAGCACAAGAAAGAATGGAGGAGGACTT 

AAGTAATCCACCCAAAAATGAAGAGCAGAAAGAAACTGTAGACAATGTTCAGCGTTGTCAGTTCCTTGAT 

GACAGAATTCTACAGTGTATAAAGCAGTATGCT<^ 
TTCTCAAAGAATTGCTTTCTTGGAAAGTTTTGAGCACAGAAGACT 

TTTTCTAAATGAAGC^GTGGACTATGTTATTCGCCACTTGATTCAAGAAAATAACAGAGTAAAGACAAGA 
ATATTTCTGCATGTTTTGAGTGAGCTTAAAGAAGTGGAGCC 

ATAGCTTTGGCTTAGATGCCACAGGAAOTTTCTTTTCTCGTTATGGAGCATCTCTTAA^ 

TAGTATCATGACTTTCCTCTGGAATGAGAAATATGGTCACAAACTAGACTCTATAGAAGGAAAGCAACTT 

GATTATTTCTCTGAGCCAGCATCATTGAAGGAAGCCCGTTGTTTAATATGGCTGCTAGAAGAACACAGAG 

ACAAGTTCCCAGCATTGCATAGTGCTTTAGATGAATTGTTTGATATAATGGACAGCCGCTGTACTGTG^ 

AAGGAAAG^GATAGTGGTGAAGCACCGTTTAGTTCAACCAAGGTGAAAAACAAAAGCAAGAAA 

CCAAAGGATTCAAAGCCTATGTTAGTTGGGTCTGGAACAACTTCAGTAACT^ 

CTTCAAGTGAAGACCATAGCAATCGAAATTCAGATTCT 

GCAAGATGTAGAAGAATTCGAAGCTCTCTATGACCAACACAGTAACGAATATGTTGTCCGCAATAAGAAG 
CTATGGGACATGAACCCAAAAC^AAAATGTT 

ATGGTCCCTTGGACATGAGTAACAAGATGTTCTCTGCAGAATATGAGTTTTTCCCAGAAGA71ACTCGACA 

GATACTAGAAAAAGC^GGAGGTTTAAAACCTTTTCTCTTGGGATGCCCTCGTTTTGTTGTG 

TGTATTGGACTGAAGAAGGTTGCATCACGGCTGAAGAAAAAAAGGAAGAAGAAAAAG^ 

TAGAAGAAATTTCAAAAGCAGGGGAGTATGTACGAGTTAAACTACAACTGAATCCAGCTGCTAGGGAATT 

TAAACCAGATGTAAAGTCTAAACCAGTGTCAGATTCATCTTC^GCACCAGCTTTTGAAAATGTGAAACCC 

AAACCTGTGTCTGCAAATTCTCCCAAGCCAGCTTGTGAAGATGTGAAGGCCAAACC^ 

CTTCTAGACAAGTTTCTGAGGATGGGCAACCCAAAGGGGTCTCT 

GGATGCAAATTAGAAGCGAGTCTCCTGTAATTCCCCCAAACCGGTTCTTGAGGATGTGAAA 
TGGGCTCAATCCCATTTGGTCACAGGATACTGTACGTATCTTCCTTTCCAGAGATT 
CACCGCCAGGATACATAAACGTGTTACCAGGTTTGCC^ 
CCTTTCTCCTGAATATC^GCTACCAAGATCAGTACC^ 

GATAAAAATGCTGCTGCCTATTTTGAGGGTCATGATTTGAATGCTGAGAATGTTGCT 
CCTCTGAAACACAGATCCTTGAGGGCTCTTTGGGAATATCTGTAAAGTCACACTGCAGCACAGGTGATGC 
TC7VTACAGTCCTGAGTGAGTCTAACAGAAATGATGAGCACTGTGGAAATTCTAACAACAAATGTGAAGTA 
ATTCCAGAAAGCACCAGTGCAGTAACAAAC^TTCCACACGTGCAGATC 
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A^TAATACACCAAGAAGTCAATACTGAGCCAT^^^ 
GACTGAAAAGGAGCACCAAGTATTACAAGACCA 

r^GACGGAATTAGATTGGTTCCTTCAAGA^ 

ACCCAGAAAAATGATGGAjyiGGAAAAGGAACATGAATTACATCTGGATCAG 
^OTACAAATGAGAAAATGAAAATAGAAGAGTATATAA^ 

TCAGAGAGCTGTGGCTGCAGAGGTATCCGTACTTGAAAACTGGAAGGAGAGTGAAGTGTATAAGCT 

A^TCGMTCACAAGCAGAAGCCTTTCTG^ 

CTGACATGGAGTCTGATATACGTTCATGGGAATTGTTTCTTTCT 

AAAGTCTCAGTTTGAAGAACAAATTAAGGCAAT^^ 

CAGATTTCTGAGCTTTCATTTCCTGCCTGTAACACGGTTCATCCCGAGTTACTCCCTGAGTCTTCA 

ACGATGGCCAAGGGCTTGTGACTTCTGCAAGCGACGTGACTGGAAACCACGCAGCACT 

TAGTGTGTTCTCTGCTGGTGATTCCCCAGGGGAGGCTCCTTCTGCGCTGTTGCCAGGGCCACCCCCT^T 

CAGCCTGAAGCCACTCAGCTGACAGGGCGAAAACGGGCTGGCCAGGCAGCTCTGTCAGAAC 

TGGCTGATCGGAAGCAGCCTGTTCCTCCAGGACGTGCTGCGCGTTCAAGCCAGTCTCCAAAAAAGC 

CAATAGTATTATTGAGCACCTGTCAGTGGTATTCCCATGTTACAACAGGACTGAGCTTGCTGGTTTTATT 

AAAAAAGTGCGAAGCAAAAACAAGAACTCACTCTCAGGATTGAGTATTGATGAAATTGTCCAAAGAGT 

CAGAACACATTCTAGATGAACAGAAAAAGAAAAAGCCAAACCCAGGAAAGGACAAGAGGACTTATGAGCC 

CAGCTCTGCCACCCCCGTGACCAGGTCCTCCCAGGGCTCACCCTCGGTGGTTGTTGCACCATCACCCAAA 

ACCAAGGGGCAGAAAGCAGAAGATGTCCCTGTGAGGATTGCACTGGGTGCAAGTTCCTGTGAAATATGCC 

ACGAGGTGTTCAAATCAAAAAACGTGCGTGTGCTCAAATGTGGGCACAAGTATGACAAAGGGTGCTTTAA 

GCAGTGGCTTAAAGGGCAGAGCGCTTGCCCGGCCTGCCAGGGTCGTGATCTCCTGACAGAAGAGTCACCT 

TCTGGAAGAGGCTGGCCGAGTCAGAATCAGGAGCTGCCTTCCTGCTCTTCTAGGTAGTCACACTTCACTA 

AAGTGTCATCCACCAGTGTGTTGAATCCGAAGAATGACAATTTTCTACCACTGGTGTAAAAAACAAACAT 

TTGAAGACCCTTGTGCATTGTGTGTCACAAAGCTAAATACATGGAAATCGTTAATATCGCTGATATTAAG 

TAATTTCCCCACTCTGAGTGAATACTTTGATGATTC 

TCATGGGTCACrGGGGCTGCGCAGGGCTCTTTGAGGTGGGTGGCTTCTTTTGGAAAGTACTATGAACGTC 
TCGAAGCAGTATTCTAGTGATAAGAATTCTTAACATAGCCAAGCGCCCCACGTTTGTTCCCCACGTTTGT 
TCCCCTTTTCTGTTTGAAAAACCTGTTCTGGTAGCTCCAC^GAGAGATGATACTGACTTTTTAAATm 

TTACAAGAGTCTGTATTCCTGATATGCCTATATTTOT 

AGCAAAGTATATTTTAATAATTTATAGTTAATGTTAAAATATTGGCTGATTTAGACCAAAAGATTCAAAT 
CTCCTCTTTGTGAAATCCCATCTGCATTTGATTTTTTATTATTTTATGTTCCCCCGTTAGATTGTT^TTAA 
GTGTTTGCTTTTCATCTTTTATAGATGTAATCTGATTTTCAAAAATCATTAA^ 

GACTAAGACTTTTTCCCCCTGGAATCGAGGCTGTGTGTCCGTCATCCCAGCCCCCGGTTGGAGCCTGCTC 

TTTGAACTCCGCTGCCTTCCTTAGCAGCTTCTGTCCTCTTCTGTGAGTCAGTCAGCGAGTGCTTGGGATC • 

CGCATGCAGCCGTGCTGAGCACACAACAGGCTGTGTGTGGAAATGGCCACCACCATTCTCCTTCCCCACC 

CCACCAGAAAAAGAGAAGCTGTGTCTTTAGACAACCCTGAGGTATCTGTGTTACAATCGTTCTGTGTTTG 

ATATTTGTGTAAAGTATGCATGCAGTCTTGTACTGTGACCTAAGAACAAAACTGTAACTGCATTAGAAAC 

CATGAAAAAATTAGATATTGTTTTGTGACTTTTAGACAGTGGTAAATATAGAACCATGAATTCTGGTCAC 

ATTC(^TTTCTCTCCAACATGAAGGATCAAAAAATGTTTTTCAATGTGTTCTTTGTTCCACTGGAAACTT 

AGAGTCATGAGTTTATGAGCTGATTTGGTCAC CTTCCTCTGC CTTTGTTCACTGTGAGTTCTGATGTCTT 

AGTGACTTAGTTCTTAGAAGCTCACGCCTTAGTTTGAAACAGAOT 

TCTGCATATCCATAAGAATTGAGCGCTATGGGTGTTAACGTGCATGAGGATCAGTTTGCAGCAGCAAGTA 
CAAAAGGAGAAGAGGAAGATCCGTTGAATGAGTGTGTTTTGTACATAACTTCAGATACTTGTGAACATG^ 
CTTATATTTGTCCAACAACTGTCAGAATAAAGAACATTCTAAAATGAG 

Human TTC3 inRNA sequence - var5 (public gi: 2959902) (seq id no: 206 

ATATAATGTGAGGGTTTTTTCCTTTTTGCGATTTAGCAGTGCTGATTGTGTATTGCAGTAGTTGTGAGAG 
CATTAGAAGCAGCAGTCGATAGGAGGATGGAAGGTCTGGATGCCGCCTTGGGGAGTTAGGAGATTGGCAG 
ACTTACCCTGTACCACTCTAGCCCTACTCCTTTGCCCAAGACAGAAACACACTGAGATGGATAGGAGAAT 
GTGAGCAGTTGATAGGAAAGTTCTCAGTGGAGTCAGGATTTAGGTTAGGCCAGGAGATTGAGAATATAAC 

AGTTTGTGTATGATGAAATGGCATATTTCACAGAATGCA 

TCAACAGCAAGATGTATTTTCGATGCCAGTTCAAC ATAAACATCT CAGT CTTAC CATGTGC 

TAGGCAACTATACAAAACAGATAAGATT^AGATGCACGATTGACGATCCTCTATGTAAAGGACGACATGTA 

CAATTCACGTGCTTAACTGAGAGTAGAGATTTGAAGAAACTAC 

TGGATTTGGCAAGGAAAGTTGCTAATGATTCATTCCTTATTGGAGGCTTATTGAGAATTGGTTGTAAAAT 
AGAAAATAAAATCTTGGCAATGGAAGAAGCTCTGAATTGGAT^ 

ACTAAATTAGGATCAATTGAC^TTGTTGGCCTATGTTAAGTATTTTCTTTACTGAATACAAGTACCACA 

TAACXAAAATTGTAATGGAAGACTGCAATTTGCTTGAAGAACTTAAAACTCAAAGTTG 

AGAGGAAGGAGGACTAATGAAAATGAAAGGAAATGAAGAGTTTTCCAAAGAAAGATTTGATATAGCTATT 

ATCTATTACACCAGAGCCATTGAATATAGACCTGAAAACTACCTTCTTTATGGTAACCGAGCTCTTTGTT 

TTCCTCX3TACTGGACAGTTTAGAAATGCACTCGGTGATGGAAAGAGAGCCACTATTCTGAAGAACACTTG 
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rrrAARGGGTCATTATCGTTATTGTGATGCTCTTTCTATGCTGGGGGAATATGACTGGGCCCTGCAAGCA 

^^aagccgaaaaccagtttaagaggattattgaacactaccccagtgagggccttga 
gcaagaaccttgatttatcgtcttcctggagtgttaacttggcca 

Human TTC3 mRNA sequence - var6 (public gi: 1304131) (seq id no : 207 

52532^^ 

a^gIgactgcttgttaaat^ 

a?gtgtgotaattttagatttcttagggaagcagaaatgaca^^ 

II™ctgataaacagcatacagaaagaggaaagaacttcatttcatatgtttgtttttgack3aaaa 

tc^^ttgaacttcctc^ttcagaggggcttgtgggacatottggttaagatcctgtagtagt^ 

IotSgg^agaagtcaagagatacaaccccgcctggaaggatttgggagtcttcagcat^g^t^ 

tggaagccattgtttactgcagtgcatatgagataatttaaactggtacatagataaacaotgaaaaaa 

tt™tagataggaatttaatgtgtatgagaaacataacttgcacatctaacctttgataatca^ 

^ggcaaaWcacggtgaagctat^ 

gtggataaattcaccc^ggagagagtaagagtgagaagaaaagagagttgacattgggtggcggggaat 
ggagaagaagaacccatgaaggaaactgaggaagagcagccagacgaat^ 

TGGTCTTTGGAGTCCAAATGAAGAGTTCTGAGGAGGAAGTGGTCCACAGCGTCAAftCTTGTGCAC^TGQ 

?caa1Stcaaaacagttccgtaatatcacgattg^^ 

GAGATAAATTTGAAGAAACTACAACATCTT^ 

ctaatgattcattccttattggaggcttattgagaattggttgtaaaatagaaaataaaatcttggcaat 

ggaagaagctctxsaattggataaaatatgcaggcgatgtaacaattctaactaaattagga^ 

aattgttggcctatgttaagtattttctttactgaatacaagtaccacataactaaaattgtaatggaag 

actgcaatttgcttgaagaacttaaaactcaaagttgtatgga 

aatgaaaggaaatgaagagttttccaaagaaagatttgatatagctattatctattacaccagagccatt 
gaatatagacctgaaaactaccttctttatggtaaccgagctctttgttttcttcgtactggacagttta 
gaaatgcactcggtgatggaaagagagccactattctgaagaacacttggccaaagggtcattatcgtta 
ttgtgatgctctttctatgctgggggaatatgactgggccctgcaagcaaacataaaagctcaaaaactc 
tgtaaaaatgaccctgagggaatcaaggatctaattcagcagc^tgtaaagttacaaaaacaaatagaag 
acctacaaggtcgaacagcaaataaggatcc^ttaaagccttttatgaaaacagggcctacacacctag 
gagtttatcagcacctatatttactacttcacttaactttgtggagaaggaaagagatttcagaaaaatt 
aatcacgaaatggccaacggtggtaatcagaatctaaaggtggcggatgaggcgttgaaggtagatgatt 
gtgactgtcatcctgaattttcaccaccatcaagtcagcctccaaaacataaaggaaaacaaaaatctcg 

AAACAATGAATCAGAAAAGTTCAG1TCTAGTTCACCATTGACTTTACCAGCAGATTTGAAGAACATCTTG 

gagaaac^gttttctaaatcttccagagctgcacaccaggattttgctaatataatgaaaatgctgagaa 

GCTTAATTCAAGATGGCTATATGGCCTTATTGGAGCAGCGTTGCCGCAGCGCTGCACAGGCCTTTACAGA 

gttgctgaacggtttagatcctcaaaaaataaagcaattgaacctggccatgattaactatgttttggtc 

GTCTATGGACTTGCCATTTCTCTCCTTGGAATAGGACAGCCTGAGGAATTATCTGAAGCCGAAAACCAGT 

ttaagaggattattgaacactaccccagtgagggccttgattgcttggcctactgtggaattggaaaagt 
gtatttgaaaaaaaacagatttctagaagctctcaatcactttgagaaagcaagaaccttgatttatcgt 

CTTCCTGGAGTGTTAACTTGGCCC^CGAGTAATGTGATTATTGAAGAGTCTCAGCCACAAAAAATAAAGA 
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TGCTGTTAGAGAAATTTGTTGAAGAATGC^^ 
TGTTGCCAGTACTGTAAAATAGAATTTCACAT 

TATCATCTTC^GCAGTGGTGGTGAAGTTAAATGTGAAT^^ 

"XICAAGAGCTATTCTGAAACAGAAATGTTCTAGC 
AGAGAAAGATCCAAAAAAAAGAAGCAAAAAAGTTAGCACAAGAAAGAATGGAGGAGG 

^^^AATCCACCCAAAAATGAAGAGCAGAAAGAAACTGT 
AGAATTCTACAGTGTATAAAGCAGT 

TCAAAGAATTGCTTTCTTGGAAAGTTTTGAGCAC^GAAGACTATACAACCTGTTTOT 
TCTAAATGAAGCAGTGGACT^TGTTATTOSCCACT 

TTTCTGCATGTTTTGAGTGAGCTTAAAGAAGTGGAGCCCAAATTAGCCGCCTGGATCCAAAAACTTAATA 
GCTTTGGCTTAGATGCCAGAGGAACTTTCTTTTCTCG - 
TATCMGACTTTCCTCT^^ 

TATTTCTCTGAGCCAGCATCATTGAAGGAAG C CCGTTGTTTAATATGGCTG CT AGAAGAAC AGAGAGACA 

AGTTCCCAGCATTGCATAGTGCTTTAGATGAATTCTTTGATA 

GAAACAAGATAGTGGTGAAGCACCGTTTAGTTCAACCAAGGTC 

AAGGATTCAAAGCCTATGTTAGTTGGGTCTGGAACAAOTTCAGTAACTTGAAATAATGAGATCATC^ 

CAAGTGAAGACCATAGCAATCGAAATTCAGATTCTGC^GGCCCATTTGCAGTGCCTGAC 

AGATGTAGAAGAATTCGAAGCTCTCTATGACCAACACAGTAACGAATATGTTGTCCGCAATAAGAAGCTA 

TGGGACATGAACCCAAAACA7VAAATGTTCAACTCTATATGATTACTTCTCTCAGTTTTTGGAGGAACATG 

GTCCCTTGGACATGAGTAACJ\AGATGTTCTCTGCAGAATATGAGTTTTTCCCAGAAGAAACTCGACAGA^ 

ACTAGAAAAAGC7VGGAGGTTTAAAACCTTTTCTCTTGGGATGCCCTCGTTTTGTTGTGATTGACAACTGT 

ATTGC ACTGAAGAAGGTTG CAT CACGGCTCAAGAAAAAAAGGAAGAAGAAAAACATT AAAACAAAAGT AG 

AAGAAATTTCAAAAGCAGGGGAGTATGTACGAGTTAAACTACAACTGAATCCAGCTGCTAGGGAATTTAA 

ACCAGATGTAAAGTCTAAACCAGTGTCAGATTCATCTTCAGCACCAGCTTTTGAAAATGTGAAACCC^^ 

CCTGTGTCTGCAAATTCTCCCAAGCCAGCTTGTGAAGATGTGAAGGCCAAACCAGTATCCGACAATTCTT 

<:tagacaagtttctgaggatgggcaacccaaaggggtctcttctaattctcctaaaccaggctctgagga 

TGCAAATTACAAGCGAGTCTCCTGTAATTCCCCCAAACCGGTTCTTGAGGATGTGAAACCAACTTATTGG 

GCTCAATCCCATTTGGTCACAGGATACTGTACGTATCTT 

CGCCAGCATACATAAACGTGTTACCAGGTTTGCCCCAGTACAC^ 

TTCTCCTGAATATCAGCTACCAAGATCAGTACCAGTGGTGCCGTC 

AAAAATGCTGCTGCCTATTTTGAGGGTCATCATTTGAATGCTGAGAATGTTGCTGGTCACCAGATTGCCT 
CTGAAACACAGATCCTTGAGGGCTCTTTGGGAATATCTO 

TACAGTCCTGAGTGAGTCTAACAGAAATGATGAGCACTGTGGAAATTCTAACAACAAATGTGAAGTAATT 

CCAGAAAGCACCAGTGCAGTAACAAACATTCCACACGTGCAGATGGTTGCCATACAGGTATCTTGGAACA 

TAATACACCAAGAAGTCAATACTGAGCCATATAATCCTTTTGAGGAACGACAAGGGGAAATTTCACGGAT 

TGAAAAGGAGCACCAAGTATTACAAGAC(^CTTCAAGAAGTGTATGAAAATTATGAGCAGATAAAACTT 

AAGGGCTTAGAAGAGACCAGGGACCTGGAAGAGAAGTTGAAAAGGCACTTAGAAGAAAACAAGATCTCAA 

AGACGGAATTAGATTGGTTCCTTCAAGATTTGGAAAGAGAAATTAAAAAATGGCAACAGGAAAAAAAAGA 

AATCCAAGAAAGACTAAAATCACTGAAGAAGAAAATTAAAAAGGTTTCAAATGCCAGTGAAATGTATACC 

CAGAAAAATGATGGAAAGGAAAAGGAACATGAATTACATCTGGAT CAGT CCCTTGAAATCAGC AACACAC 

TTACAAATGAGAAAATGAAAATAGAAGAGTATATAAAGAAAGGGAAAGAGGATTATGAAGAGAGTCATCA 

GAGAGCTGTGGCTGCAGAGGTATCCGTACTTGAAAACTGGAAGGAGAGTGAAGTGTATAAGCTACAGATC 

ATGGAGTCACAAGCAGAAGCCTTTCTGAAGAAGCTGGGGCTGATTAGCCGTGATCCTGCAGCATATCCTG 

ACATGGAGTCTGATATACGTTCATGGGAATTGTTTCTTTCTAATGTTACAAAAGAAAT 

GTCTCAGTTTGAAGAACAAATTAAGGCAATTAAAAATGGl^CTCGGCTCAGTGAACTTTCTAAAGTGCA 

ATTTCTGAGCTTTCATTTCCTGCCTGTAACACGGTTCATCCCGAGTTACTCC 

ATGGCCAAGGGCTTGTGACTTCTGCAAGCGACGTGACTGGAAACCACGCAGCACTTCACAGGGATCCTAG 

TGTGTTCTCTGCTGGTGATTCCCCAGGGGAGGCTCCTTCTGCGCTGTTGCCAGGGCCACCCCCTGGTCAG 

CCTGAAGCC^CTCAGCTGAC^GGGCCAAAACGGGCTGGCC^GGCAGCTCTGTCAGAACGAAGCCCTGTGG 

CTGATCGGAAGCAGCCTGTTCCTCGAGGACGTGCTGCGCGTTCAAGCCAGTCTCCA^ 

TAGTATTATTGAGCACCTGTCAGTGGTATTCCCATGTTACAACAGCACTGAGCTTGCTGGTTTTATTAAA 

AAAGTGCGAAGCAAAAACAAGAACTCACTCTCAGGATTGAGTATTGATGAAATT 

AACACATTCTAGATGAACAGAAAAAGAAAAAGCCAAACCCAGGAAAGGA 

CTCTGCCACCCCCGTGACCAGGTCCTCCCAGGGCTCACCCT 

AAGGGGCTVGAAAGCAGAAGATGTCCCTGTGAGGATTGC^CTGGGTGCAAGTTCCTGTGAAATATGCCACG 

AGGTGTTCAAATCAAAAAACGTGCGTGTGCTCAAATGTGGGCACAAGTATCACAAAGGG 

GTGGCITAAAGGG<^GAGCGCrTGCCCGGCCTGCCAGGGTCGTGATCTCCTGAC^ 

GGAAGAGGCTGGCCCAGTCAGAATCAGGAGCTGCCTTCCTGCTCTTCTAGGTAGTCACACTTCACTAAA^ 

TGTC^TCCACCAGTGTGTTGAATCCGAAGAATGAC^ 
AAGACCCTTGTGCATTGTGTGTCACAAAGCTAAATACATG^ 

TTTCCCCACTCTGAGTGAATACTTTGATGATTGCCAACAGTGGCTAATAAAATGACGGCTACCACACTCA 
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TGGGTCACTGGGCTGCGCAGGGCTCTTTGAGGTGGGTGGCTTCCT 
AGCAGTATTCTAGTGATAAGAATTCTTAACAT^^ 

CTTTTCTGTTTGAAAAACCTGTTCTGGTAGCTCCACAAGAGAGATGATACTGACTTTTTAAA 

AAGAGTCTGTATTCCTGATATGCCTATATT 

AACTATATTTTAATAATTTATAGTT^ 

TCTTTGTGAAATCCCATCTGCATTTG^ 

TTGCTTTTCATCTTTTATAGATGTAATCTGATTTTCAAAAATCATTAACACTTTTTAATTAGTATCG 

AAGACTTTTTCCCCCTGGAATCGAGGCTGTGTGTCCGTCATCCCAGCCCCCGGTTGGAGCOTGCTCTTTG 

AACTCCGCTGCCTTCCTTAGCAGCTTCTGTCCTCI^CTGTGAGTCAGTCAGCGAGTGCTTGGGATCCG^ 

TCCAGCCGTGCTGAGCACACAACAGGCTGTGTC^ 

CACAAAAAGAGAAGCTGTGTCTTTAGACAACCCTGAGGT^ 

TTGTGTAAAGTATGGATGCAGTCTTGTACTGTGACCT^ 

AAAAAATTAGATATTGTTTTGTGACTTTTAGACAGTGGTAAATATAGAACCATGAATTCTGGTCACATTC 
CATTTCTCTCC^CATGAAGGATCAAAAAATGTTT 

GTCATGAGTTTATGAGGCTGGATTTGGGCZACCTTTCCTTTGCCTTTGGTTCACTGTGAGTTCTGATGTCC 

TAGTGACTTAGGTCTTAGAAGCTCACGCCTTAGTTT^ 

GTCTGGATATCCATAAGAATTGAACGCTATGGGTC 

ACAAAAGGAGAAGAGGAAC^TCCGTTGAATGAGTGTGTTTTO 

CCTTATATTTGTCCAACAACTGTCAGAATAAAGAACATTCTAAAATGAG 

Human TTC3 Protein sequence - varl (public gi: 2662364) (seq id no: 308) 

IKINIFWPLLFQHQNSSVISRLHPCVDANNSRAS^^ 

GCKI ENKI LAMEEALNWI KYAGDVTI LTKLGS IDNCWPMLS I FFTEYKYHITKIVMEDCNLLEELKTQSC 
MDCIEEGELMKMKGNEEFSKERFDIAIIYYTRAIEYRPENYLIjYGNRALCFLRTGQFRNALGDGKRATIL 
KNTWPKGHYRYCDALSMIjGE YDWALQANI KAQKLCKNDPEG I KDL I QQHVKLQ KQI EDLQGRTANKDPI K 
AFYENRAYTPRSLSAPIFTTSLNFVEKERDFRKINHEMANGGNQNLKVADEALKVDDCT 
PPKHKGKQKSRNNESEKFSSSSPLTLPADLKNIIiEKQFSKSSRAAHQDFANIMKI^R^ 
^CRSAAQAFTEIiLNGLDPQKIKQIiNIxfiMINYVLVVYGIjA^ SLLGI GQPEELSEAENQFKRI I EHYPSEGL 
DCLAYCGIGKVYLKKNRFLEALNHFEKARTLIYRLPGVLTWPTSW 
PVPDAICCYQKCHGYSKIQIYITDPDFKGFIRISCCQYCKIEFH^ 

TPDCEGVISKI I IFSSGGEVKCEFEHKVIKEKVPPRPILKQKCSSLEKLRLKEDKKLKRKIQKKEAIOaA 
QERMEEDLRESNPPKNEEQKETVDNVQRCQFLDDRILQCI KQYADKI KSGIQNTATLLKELLSWKVLSTE 
DYTTCFS SRNFLNEAVDYVTRHLI QENNRVKTRI FLHVLS ELKE VEPKLAAWI QKLNS FGLDATGTFFSR 
YGASLKLLDFSIMTFLWNEKYGHKLDSIEGKQLDYFSEPASLKEAR^^ 
DIMDSRCIVLRKQDSGEAPFSSTKVTCNKSKKKXPKDSKPMLVGSGOT 
GPFAVPDHLRQDVEEFEALYDQHSNEYVVRNKKLM 

YEFFPEETRQILEKAGGLKPFLLGCPRFWIDNCIAIiKKVASRLKKKRKK^ 

LQLNPAAREFKPDVKS KPVSDS SSAPAFENVKPKPVSANS PKPACEDVKAKPVSDNS SRQVS EDGQPKGV 
S SNS PKPGSEDANYKRVS CNSP KPVLEDVKPTYWAQSHLVTGYCTYLPFQRFDI TQTP PAY INVLPGLPQ 
YTSIYTPLASLSPEYQLPRSVPWPSFVANDRADKNAAAYFEGHHLNAENVAGHQIASETQILEGSLGIS 
VKSHCSTGDAHTVIiSESNRNDEHCGNSNNKCEVIPESTSAVTNI PHVQMVAIQVSWNI IHQEVNTEPYNP 
FEERQGE I SRI EKE2TOVLQDQLQEVYENYEQI KLKGLEETRDLEEKLKRHLEENKI S KTELDWFLQDLER 
EIKKOTQQEKKEIQERLKSLKKKIKKVSNASEMYTQKNEK3I^ 
KGKEDYEESHQRAVAAEVSVLENWKESEVYKLQIMESQAEAFL^ 

SNVTKVIEKAKSQFEEQI KAI KNGSRLSELSKVQI SELSFPACNTVHPELLPES SGDDGQGLVTSASDVT 
GNHAALHRDPSVFS AGDS PGEAPS ALLPG P PPGQPE ATQLTGPKRAGQAALS ers P VTDRKQPVPPGRAA 
RS SQSPKKPFNS 1 1 EHLS WFPCYNSTELAGFI KKVRSKNKNSLSGLS I DE I VQRVTEHI LDEQKKKKPN 
PGKDKRTYEPSSATPVTRSSQGSPSWVAPS PKTKGQKAEDVPVRIALGASSCEI CHEVFKSKNVRVLKC 
GHKYHKGCFKQWIiKGQSACPACQGRDLLTEESPSGRGWPSQNQELPSCSSR 

Human TTC3 Protein sequence - var2 (public gi: 1632766) (seq id no: 309) 

MLGE YDWALQANI KAQ KLCKND PEGI KDL I QQHVKLQKQ I EDLQGRTANKDP I KAFYENRAYTPRSLS AP 

I FTTSLNFVEKERDFRKINHEMANGGNQNLKVADEAIiKVDDCDCHPEFS PPSSQPPKHKGKQKSRNNESE 

KFS SSS PIiTLPADLKNI LEKQFSKS SRAAHQD FANIMKMLRSLI QDGYMALLEQRCRSAAQAPTELLNGL 

DPQKIKQLNLAMINYVLVVYGLAISLLGIGQPEELSEA^ 

RFLEALNHFEKARTLIYRLPGVLTWPTSOTIIEESQPQKIKMLLEK^ 

KIQIYITDPDFKGFIRISCCQYCKIEFHMNCWKKIiKTTTFNDKIDKDFLQG^ 

GGEVKCEFEHKVIKEKVPPRPILKQKCSSLEKLRLKED 

EEQKETVDNVQRCQFLDDRI LQCI KQYADKI KSGIQNTAMLLKEIjLSWKVIjSTEDYTTCFSSRNFIiNEAV 
DYVIRHLIQENNRVKTRI FLHVLS ELKEVE PKLAAWI QKI^SFGLDATGTFFSRYGASLKIiLDFS IMTFL 
WNEKYGHKLDSIEGKQLDYFSEPASLKEARCLIWLLEEHRDK^ 
EAPFSSTOTKNKSKKKKPKDSKPMLVGSGTTSVTSNNEI 
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EALYDQHSNEYVVRNKKLWDMNPKQKCSTLYDYFSQFLEEHGPLDMSN^ 

GLKP FLiLG C PRFW I DNC IJVLKKV ASRLKKKRKKKN I KTKVE E I S KAGE YVRVKLiQLNP AARE FKPDV KS 
KPVSDSSSAPAFENVKPKPVSANSPKPACEDVKAKPVSDNSSRQVSEDGQ 

VSCNSPKPVLEDVKPTYWAQSHLVTGYCT YLPFQRFDITQTP PAYINVLPGLPQYTS I YTPLASLS PE YQ 
LPRSVPVVPSFVANDRADKNAAAYFEGHHLNAENVAGHQIASE 

SNRNDEHCGNSNNKCEVI PESTSAVTNI PHVQMVAIQVSWNIIHQEVNTEPYNPFEERQGEISRIEKEHQ 
VLQDQLQEVYENYEQI KLKGLEETRDLEEKLKRHtiEENKI S KTELDWFLQDLEREI KKWQQEKKE I QERL 
KSLKKKIKKVSNASEMYTQKNDGKEKEHELHLDQSLEI SNTLTNEKMKIEEYI KKGKEDYEESHQRAVAA 
EVSVLENWKESEVYKLQII^SQAEAFLKKLGLISRDPAAYPDMESDI^ 
QIKAIKNGSRLSELSKVQISELSFPACNTVHPELLPESSGHDGQGLVTS^^ 

DSPGEAPSALLPGPPPGQPEAXQLTGPKRAGQAALSERSPVADRKQPVPPGRAARSSQSPKKPFNSIIEH 
LSVVFPCYNSTELAGFIKKVRSKNTOJSLSGLSIDEIV^ 

TRS SQGSPSVWAPS PKTKGQKAEDVPVRI ALGASSCEI CHEVFKS KNVRVLKCGHKYHKGCFKQWLKGQ 
SACPACQGRDLLTEESPSGRGWPSQNQELPSCSSR 

Human TTC3 Protein sequence - var3 (public gi: 1632764) (seq id no: 3io) 

MKMKGNEEFSKERFDIAIIYYTRAIEYRPENYLLYGNRALCFLRTGQFR^ 

RYCDALSMLGE YDWALQANI KAQKLCKNDPEGI KDL»I QQHVKLQKQ I EDLQGRTANKDP I KAFYENRAYT 
PRSLSAPIFTTSLNFVEKERDFRKINHEMANGGNQNLKVADEALKVDDCDCHPEFSPPSSQ 
SRNNESEKFSSSSPLTLPADLKNILEKQFSKSSRAAHQDFANIMKMLRSLIQDGYMALLEQR^ 
TELI^GLDPQKIKQLNLAMINYVLVVYGLAISLIX3 

KVYLKKNRFLEALNHFEKARTLIYRLPGVLTWPTSNVIIEESQPQKIKM^ 
QKCTGYSKIQI YITDPDFKGFIRI SCCQYCKIEFHMNCWKKLKTTTFI^KIDKD 
KIIIFSSGGEVKCEFEHKVIKEKVPPRPILKQKCSSLEKLRLKEDKKIiKRKIQKKEAKia^ 
ESNPPKNEEQKETVDNVQRCQFIJDDRILQCIKQYADKIKSGIQOT^ 

NFLNEAVDYVI RHLI QENNRVKTRI FLHVLSELK35TVEPKLAAWI QKLNS FGLDATGTFFSRYGASLKLLD 
FSIMTFLWNEKYGHKLDS I EGKQLDYFSEPASLKEARCLIWLLEEHRDKFPALHSALDEFFDIMDSRCTV 

'lrkqdsgeapfsstkvknkskkkkpkdskpmlvgsgttsvtsnneii^^ 

rqdveefealydqhsneyvvrnkklwdmnpkqkcstlydyfsqfleehgpldmsnkmfsaeyeffpeetr 
qilekagglkpfllgcprfwidncialkkvasrlkkjcrkkkniktkve 

fkpdvkskpvsdsssapafenvkpkpvsanspkpacedvkakpvsdnssrqvsedgqpkgvssnspkpgs 
edanykrvscnspkpvledvkjptywaqshlvtgyctylpfqrfditqtppayinvlpglpqyts^ 

SLSPEYQLPRSVPWPSFVANDRADKNAAAYFEGHHLNAENVAGHQIASETQILEGSLGISVKSH^ 

AHTVLSESNRNDEHCGNSNNKCEVI PESTSAVTNI PHVQMVAI QVSWNI IHQEVNTEPYNPFEERQGE I S 

RIEKEHQVLQDQLQEVYENYEQIKLKGLEETRDLEEKLKRHLEENKISKTELDWFLQDLEREI 

KEI QERIjKSLKKKI KKVSNAS EMYTQKNDGKEKEHELHLDQS LE I SNTLTNEKMKI EEY I KKGKED YEES 

HQRAVAAEVSVLENWKESEVYKLQIMESQAEAFLKKLGLI SRDPAAYPDMESDI RSWELFLSNVTKEI EK 

AKSQFEEQIKAIKNGSRLSELSKVQISELSFPAOSrTVHPELLPESSGHDGQGLVTS^ 

PSVFS AGDS PGEAPSALLPGPP PGQPEATQLTGPKRAGQAALSERS PVADRKQPVP PGRAARSSQS PKKP 

FNS 1 1 EHLS WFPCYNSTELAGFI KKVRS KNKNSLSGLS IDE I VQRVTEHI LDEQKKKKPNPGKDKRT YE 

PS S ATP VTRS SQGS PS VWAPS PKTKGQKAEDVPVRI ALGAS S CE I CHEVFKS KNVRVLKCGHKYHKGCF 

KQWLKGQSACPACQGRDLLTEESPSGRGWPSQNQELPSCSSR 

Human TTC3 Protein sequence - var4 (public gi: 1632762) (seq id no: 3ii) 

mdnfaegdftvadyalledcphvddotfaaefmsndyvrvtqlycdgvgvqykdyiq 

csk^isvlqdycdaikinifwpllfqhqnssvisrlhpcvdannsraseinlkx^ 

VANDSFLIGGLLRIGCKIENKSLAMEEALNWIKY^ 
EDCNLLEELKTQSC^nDCIEEGELMKMKGNEEFSK^ 

FRNALGDGKRATI LKNTWP KGHYRYCDALSMLGE YDWALQANI KAQKLCKNDPEGI KDL I QQHVKLQKQ I 
EDLQGRTANKDPIKAFYENRAYTPRSLSAPIFTTSLNFVEKERDFRKINHEMANGGNQNLKVAD^ 

DODCHPEFSPPSSQPPKHKGKQKSRNNESEKFSSSSPLTLPADLKNILEK^^ 
RSLIQDGYMALLEQRCRSAAQAFTELLNGLDPQKIKQLNI^ 

QFKRI I EHYPSEGLDCI^YCGIGKVYLKKNRFLEAIiNHFEKART^ I EESQPQKI 

KMLLEKFVEECKFPPVPDAICCTQKCHGYSKIQIYITDPDFKGFI^^ 

NDKIDKDFLQGICLTPDCEGVISKIIIFSSGGEVKCEFEHKVIKEKVPPRPILKQKCSSLEKLRLKEDK^ 

LKRKIQKKEAKKLAQERMEEDLRESNPPKNEEQKETVDNVQRCQFLDDRILQCIKQ 

LLKELLSWKVLSTEDYTTCFSSRNFLNEAVDYVIRHLIQENNRVKTRIF 

NSFGLDATGTFFSRYGASLKLLDFS IMTFLWNEKYGHKLDS IEGKQLDYFSEPASLKEARCLIWLLEEHR 
DKFPALHSALDEFFDIMDSRCTVLRKQDSGEAPFSSTKVKNKSKKK^ 

TSSEDHSNRNSDSAGPFAVPDHLRQDVEEFEALYDQHSNEYVVRNKKLWDMNPKQKCSTLYDYFSQFLEE 

HGPLDMSNKMFSAEYEFFPEETRQILEKAGGLKPFLLGCPRFVVIDNCIALKKVASRLKKKRK^ 

VE E I S KAGE YVRVKLQLNP AARE FKPDV KS KPVSDS S S AP AFENVKPKP VS ANS PKP ACEDVKAKPVSDN 
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SSRQVSEDGQPKGVSSNSPKPGSEDANYKRVSC^SPKPVLEDVKPT^ 
TPPAYINVIjPGLPQYTSIYTPIjASIiSPEYQLPRSVPVVPSFVANDRADKNA^ 

ASETQILEGSIjGI SVKSHCSTGDAHTVIiSESNRNDEHCGNSNNkCEVI PESTSAVTNI PHVQMVAIQVSW 

NIIHQEWTEPYNPFEERQGEISRIEKEHQVLQDQLQEVYENYEQIKIjKGLEETRDLEEK^ 
. SKTELDWFLQDLEREI KKWQQEKKEIQERLKSLKKKI KKVSNASEMYTQKNDGKEKEHELHIjDQSLEISN 
"tLTNEKMKIEE YI KKGKEDYEESHQRAVAAEVSVLENWKESEVYKLQIMESQAE AFLKKLGLI SRDPAAY 

PDMESDIRSWEIiFLSNVTKEIEKAKSQFEEQIKAIKNGSRLSEL^ 
^GQGLOTSASDVTGimAAIiHRDPSVFSAGDSP 

VADRKQPVPPGRAARSSQSPKKPFNSIIEHLSVVFPCYNSTEIiAGFIKKVRSKNKNSLSGLSIDEIV 

TEHIIiDEQKKKKPNPGKDKRTYEPSSA^ 

HEVFKS KNVRVLKCGHKYHKGCFKQWLKGQS ACPACQGRDLLTEES PSGRGWPSQNQELPS CS SR 

Human TTC3 Protein sequence - var5 (public gi: 2969903) (seq id no: 31*? 

DLKKLQHLEIJ^DIVDIjARKVANDSFLIGGLIjRIGCKI ENKILAMEEALNWI KYAGDVTILTKLGSIDNC 
WPMLSIFFTEYKYHITKIVMEDCNIjLEELKTQSCM^ 

RPENYTjLYGNRALCFPRTGQFRNAIjGDGKRATI LKNTWPKGHYRYCDALSMLGEYDWALQANI KAQKLCK 
NDPEGIKDIiIQQHVKLQKQI EDLQGRTANKDP I KAFYENRAYTPRSLSAPIFTTSIjNFVEKERDFRKINH 
EMANGGNQNLKVADEALKVDDOT^ 

QFSKSSRAAHQDFANIMKMLRSLIQIXSYMALLEQRCRSAAQAFTELLNGIiD 

GLAISLLGIGQPEELSEAENQFKRIIEHYPSEGLDCI^^ 

GVLTWP 

Unigene Name: UBE2 N Unigene ID: Hs. 458359 

Human UBE2N mRN A sequence - varl (public gi: 37577134) (SEQ ID NO: 208) 

^CGCGCGC^GTCGTOCGCGGGTCGTGCCGTACC^CCGTCGCGGGCAGGCTCGGCCACGAGCGCCAGAGC 
CCCGCGCCTCCCCTCGCGGCCTGTCCCAA 

CAGCGGTTGGGGCGGGGCGCACGGGGGAGGGGGCCAGGTCX3GAGGGAAGCCCGCCCGTGCCCGAGCCCGC 
GCCCGAGCAGGGACTACATTTCCCGAGGGGCCTCGGCGGCGGCTGCGGCGACGGGCGCGGCAACGTCCCC 
CGGAAGTGGAGCCCGGGACTTCCACTCGTGCGTGAGGCGAGAGGAGCCGGAGACGAGACCAGAGGCCGAA 
CTCGGGTTCTGACAAGATGGCCGGGCTGCCCCGCAGGATCATCAAGGAAACCCA 

CCAGTTCCTGGCATCAAAGCCGAACCAGATGAGAGCAACGCCCGTTATTTTCATGTGGTCATTGCTGGCC 
CTCAGGATTCCCCCTTTGAGGGAGGGACTTTTAAACTTGT^CTATTCCTTCCAGAAGAATACCCAATGGC 
AGCCCCTAAAGTACGTTTCATGACCAAAATTTA^ 

GATATTTTGAAAGATAAGTGGTCCCCAGC^CTGCAGATCCGCACAGTTCTGCTATCGATCCAGGCCTTGT 
TAAGTGCTCCCAATCCAGATGATCCATTAGC^^TGATGTAGCGGAGC^GTGGT^GACCAACGAAGCC^ 
AGCCATAGAAACAGCTAGAGCATGGACTAGGCTATATGCC^ 

AAGTGTGCATCACTTCTCCTGTTCTGCCAAGACTTCCTCCTCTTTGTTTGCATTTAATGGACACAGTCTT 
AGAAACATTACAGAATAAAAAAGCCCAGACATCTTCAGTCCTTTGGTGATTAAATGCACATTAGCAAATC 

TATGTCTTGTCCTGATTCACTGTCM^ 

ACGTTTAAAAGCAGTGGCCCCTCCCTGCTTTTATTC^TTTCCCCCATCCTGGTTTAAGTATAAAGCACTG 
TGAATGAAGGTAGTTGTCAGGTTAGCTGCAGGGGTGTGGGTGTT^ 

TTTGAGGGGGGAGGTAGTTTAATTTTATGGGCTCCTTTCCCCCTTTTTTGGTGATCTAATTGCATTGGTT 
AAAAGCAGCTAACCAGGTCTTTAGAATATGCTCTAGCCAAGTCTAACTTTATTTAGACGCTGTAGATGGA 
CAAGCTTGATTGTTGGAACCAAAATGGGAACATTAAACAAAC^ 
TGAAGTGTAGATTCCCCCCTTCAAAAAAAGCT^ 

AATCTTATGTTTTAGTAAAATATTTTTTGTTATTCTACTTTGCCTTTGTACAGTTTA 

ATTTCATTTTCCCAATTTGACAATCGTATTTTAAAAT 

CCATCTGACAAATTGAATGGCAAGAGGTGGATTTTGCC^^ 

GATAGTACTGAATGGAGTATTTATAAACTGGCCCTGAGCATGCATAAAGCATCAGTATCTGACCTTTTTT 

TAACCTTCTAGGAATTTGAAATAAATGTGTTTGTGTTGTCTGATTAGATGATCATTGGTGTCTTGCCACA 

ATGTTTAAAAATTACTGTACAGGAAAGTCACAGCAAAGATAG 

GM3TTGTGCGACATTTTTGCCTAAAACT 

CTGTAACCTTTCATGTGTGTTAAGAAACACT^ 

TTCCTTATAAACCTGATGAGCAGATCTCy^CT^ . 
AGAGTATCAAGTCCTCAGTTAATGATTCTTGCTTTCATCCCTCCAGTATTTGCTGTGGGAGCTCGTTTT^ 

TTCTTTAATTTGGAATTCAGTAATT^ 

CCCACCTCTCTCC^TATCTAATTCCTGATTCTTGTTATTTTTAAGTCATAAATGTAGCCAG 

CATAAATGTTAACCTTCGGGTTGCAACCTTGTCTCTTGCAGTTTAAGGTAATG 

GAATTTTCTTCACTCTTATTCTCGTAATTCTGGAGTTTCTTCAGATTGT 

ATGTAAGATGAAGAATTAACTATTAAAATTACATTTTCAACATACAAAAGCTTTT 

GTATCCTTCCAAATAAATGCATTGCTTGGTAAAAAAAAAAAAAAAAAA 
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Human UBE2N protein sequence - varl (public gi: 4507793) (seq id no: 313) 

MAGLPRRIIKETQRIiIiAEPVPGIKAEPDESN^ 

FMTKI YHPNVDKLGRICLDI LKDKWS PALQIRTVLLS I QAIiLS APNPDDPLANDYAEQWKTNEAQAI ETA 
RAWTRLYAMNNI 

Human UBE2N pray sequence - varl (SEQ ID NO: 209) 

(XICGCCATGGNGTACCCATACGACGTACCAG^ 
C^U^GCAGTGGTATCAACGCAGAGTGGCCGAACTCGGGTTCT 
CATCAAGGAAACCCAGCGTTTGCTGGCAGAACCAGTTCCTGGCA 
GCCCGTTATTTTCATGTGGTCATTGCTGGCCCTCAGGATTCC 

AACTATTCCTTCCAGAAGAATACCCAATGGC^GCCCCTAAAATAAGTGGTCCCCAGCACTGCA 

ACAGTTCTGCTATCGATCCAGGCCTTGTTAAGTGCTCCCAATCCAGATGATCCATTAGCAAATGATGTAG 

CGGAGGAGTGGAAGACCAACGAAGCCCAAGCCATAGAAACAGCT^ 

GAATAATATTTAAATTGATACGATCATG^GTGTGCA^ 

TTTGTTTGCATTTAATGGACAGAGTCTTAGAAACATTAC^ 

TNGGTGATTAAATGCAC^TTANCAAATOT 

AAATTTNATCTGGATNNTNGGAAACNTTNAAAACNN^ 

•NTNAAOTTAAACCCNGGAATNANGGNNTTTNCNGNNACNNNGGGGGT 



Human UBE2N pray sequence - var2 (SEQ ID NO: 210) 

CGAGCGCCGCCTGGNOTACCCATACGACGTACCAGNATTACGCTCATATGGCCATGGNAGGCC^ 
TCCACCCAAGCAGTGGTATCAACGCAGAGTGGCCATTATGGCCGGGGGAGAGGAGCCGGAGACGAGACCA 

GAGGCCGAACTCGGGTTCTGACStfVGATGGCC^GGCTO 

CTGGCAGAACCAGTTCCTGGCATCAAAGCCGAACCAGATGAGAGCAACGCCCGTTATTTTCATGTGGTCA 

TTGCTGGCCCTCAGGATTCCCCCTTTGAGGGAGGGACTTTTAAACTTGAACTATTCCTTCCAGAAGAATA 

CCCAATGGC^GCCCCTAAAGTACGTTTCATGACGAAAATTTATCATCCTAATGTAGACAAGTTGGGAAGA 

ATATGTTTAGATATTTTGAAAGATAAGTGGTCCCCAGCACTGCAGATCCGCACAGTTCTGCTATCGATCC 

AGGCCTTGTTAAGTGCTCCCAATCCAGATGATCCATTAGCAAATGATGTAGCGGAGCAGTGGAAGACCAA 

CGAAGCCCAAGCCATAGAAACAGCTAGAGCATGGACTAGGCTATATGCCATGAATAATATTTAAATTGAT 

ACGATGATCAAGTGTGCATCACTTCTCCT 

CAGA.GTCTTANAAACNTTNNGAATAAAAANCCCANA 

NNNTTNTNTTGNCNGNTNCCTGNNTAAACCTGNCCNAGNC™ 

ANNNTGNCCCCCmiNTTTTTTNTTNC^ 

GGGNGGGNTTTTTTTTTTTTTTTTTNTN 



Unigene Name; UNC84B Unigene ID; Hs. 406612 

Human UNC84B mRNA sequence - varl (public gi: 31742497) (SEQ ID NO: 21 1) 

CCGCCCGCCCGCCTTGTCCCGCGTCGCCGCTCTTCGCGTGCCCGCGCGCCCCGGGCCCGGCCGCTGTGTC 
GCCCTGAGCGGAGCGCCGCCCXSCX^GATCCCCACGCGGAA 
CGGACGCCCCCGGCCGGGCTAGAAGCCGCCGCGGCAGCAGATTCT 
CCTCATCATGTCCCGAAGAAGCCAGCGCCTCACGC^ 

AGCGGAGGGAGCTCGGTGGCTGGGAGTCAGAGCACCCTGTTTAAAGACAGTCCTCTCAGGACCTTGAAGA 
GGAAATCCAGCAACATGAAGCGCCTGTCCCCAGCGCCACAGCTGGGCCCGTCCTCTGATGCACACACCTC 

CTACTACAGTGAGTCGCTGGTCCACGAGTCCTGGTTCCCACCC^ 

GACGCCAACTGGGGTGAGGACCTGCGGGTGCGGAGGAGGAGAGGCACGGGTGGCTCAGAGAGCAGCAGGG 
CGAGCGGGCTTGTGGGGCGCAAGGCCACCGAGGACTTCC^ 

CGACTACGTGGGCTACTCGGATGTGGACC^GCAGAGTTCCAGCTCGCGGCTCCGAAGCGCCGTOTC^ 

GCGGGCTCCTTACTCTGGATGGTGGCCACIT^ 

GGAGCACCTGGTACCGCCTGACC^CAGCTCC 

GTCCCTGAAGACGTTCCTCTGGTTCCTGCTGCCGCTGCTCTTGCTGACGTGCCTGACGTATGGTGCTTGG 
TATTTCTACCCCTATGGGCTGCAGACATTCCACCCTGCT 

GGAGGCCGGATGAGGGCTGGGAAGCCAGAGACTCATCGCCACATTTCCAGGCTGAGC^ 
CCGGGTACACTCTCTGGAGCGGCGTCTGGAAGCTOT 

GCCATGCGGCTGGAACGTCTGGAGCTGCGGCAAGGGGCTCCTGGCCAGGGAGGTGGTGG 
ACGAGGACACCCTGGCGCTGCTGGAGGGGCTAGTGAGCCGCCGTGAAGCTGCCCTGAAGGAGGATTTCCG 
CAGGGAAACTGCTGCTCGCATCCAGGAAGAACTGTCTGCCCTGAGAGCAGAGCATCAGGAA 
GACCTCTTCAAGAAGATCGTCCGGGCCTCCCAGGAGTCCGAGGCT 
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OOCAAAGCATGACCCAGGAGTCCTTCCAGGAGAGCTCTGTGAAGGAGCTGAGGCGGCTGGAGGACCAGCT 
GgI^SSg^cS 

TTGCCCGAGGTGGAGGGGGCCGCGTGGGGCTCCTTCAGAGAGAGGAGATGCAAGCTCAGCTGCGAGAGCT 

gSSgcaagatcotcacccatgtggcagagatgcaggg^ 

agcSg^ctgcagSagaaggtgtgattggagtgacagaggagcaggtgcacca^ 

ccctgcagcgctacagtgaggaccgcatcgggctggcagactacgccctggagtc^ggaggg^ 

IggggcScIaggcSSccgtcsgtccgcct 

^cSa^ccttgtcacccaacagcactatctccagtgcccccaa^^ 
gacgaagacctgcagcaggaggggacactccttggcaagttcacttacgatcaggacggcgagcctattc 

agISg^cactScaS^ 
g^ca^gagtacacctgcat^^ 

gcgctctgacttctaggagcacargagaggagcctgtggccccatgcagatgaaaaggacgggcagggtc 

TCraAGCAGCAGGTGGCTCGAGG^^ 
GTGGCGTCTGCTTrc^ 

AGGTAGGACCTGACTGGTCTTGGCTGGGGTCAGGGGCTGGTGCCTGGGAGCTGATGAAGCAGGTGCCAGG 
CTCTGGGAGGAGGGTGCTTAGCTCCAGAGCAGGATGGGACTTCC^ 

Sgcccagc?ctcctacaaggttagtgccctccacctagggaagcatgaaccac^^ 

TTCGACAAAAGTGTGTATTTGTCCCGGGGAGGGTAGCAGTGGGCCATGGGGCTTCTTGTOCCCT^ 
GA^GCTGCTGTGATCTTCTAAGGGGCCC^GGGCCAACCCTGTAGGCTTCCCCTCTGCTGGGGACGGTA 

gcSctgagcttgctttccacattctcccagggtatggagacctagacctgtctttggggccattag^ 
ctggggttatagcaagaagagtcsgggagcatggaactcctgggctcttgtggggacgttcagggtatcgg 

•Stocgaggtctgtctgcacc^^ 
gcccccttgggctcttgcagctcttgttcagg™^^ 

tgggggctgggggaagaggctggcagaagttaccctggatagggaagggggaggaggggaotttagagc 
cagcaggccccactgtattatgtatatatttttcaaggtctgtttttctaactgaaaagctaagggcttg 
attcctagccccgttctgtggggcactgggtgatactcagtttcttgttcctggccgtggagacmggcct 
ggggcactggttccggctgtgtctggtggtctggctgcggggaaggggcaagaaggcgggcaggcotca 
ctgcagcactgagcctcaaatccgctctggagcatgaggctggatgcagtggtggtgaggccgccccgct 
ccatcccgaggcagccagggtttgttttgcgctctcctgtcacaaatgctgcactattggttcttaagtt 
ttttatctccagatcctaatttatgcctatgcaaaaaataaatgacgcccaagagctg 

Human UNC84B protein sequence - varl (public gi: 3174249a) <seq ^ no: 314) 
msrrsqrltrysogdddgssssggssvagsqstlpkdsplrtlioucssnmkrlspapqi^pssdaotsyy 

seslvheswfpprssi^ei^gdanwgedlrvrrrrgt^ 

vgysdvdqqssssrlrsavsragsllwmvatspgrlfrllywagttwyrlttaaslldvfvltrr^ 

KTFLWFLDPLLI.LTCLTYGAWYFYPYGr.QTFHPALVSWWAAKDSRRPDEGWEARDSSPHFQAEQRVMSRV 
HSLERRLEALAAEFSSNWQKEAMRIjERIiELRQGAPGQXSGGGGIjSHEDT^ 

TAARIQEELSALRAEHQQDSFJDLFKKIVRASQESEARICffiLKSEWQSMTQESFQESSVKELRRLE^LAG 
LQQELAAIjAIjKQSSVAEEVGLIiPQQIQAVRDDVESQFPAWrSQFIiARGGG^ 

KIIiTHVAEMQGKSAREAAASLSLTLQKEGVIGVTEEQVHHIVKQAMRYSFJ3RIGI»ADYAIjESGGASVIS 

trcsetyetktallslfgipi.wyhsqsprvilqpdvhpgnotafqgpogfavvrlsarirptavtmhvp 
kalspnstissapkdfaifgfdedlqqegtllgkftydqdgepiqtfhfqaptmatyqwelriltnwgh 

peytciyrfrvhgepah 

Human UNC84B pray sequence - varl (SEQ ID NO: 212) „ m ^ AW , 
gatttggnaatnctacagggnatgtitaataccactacaatggatgatgtatataactatctattcgato 
atgaagataccccaccaaacccaaaaaaagagatctttaatacgactcgactatagggcgagcgccgcca 
tggagtacccatacgacgtaccagattacgctcatatggccatggaggccagtgaattccacccaagcag 
tggtatcaacgcatagtggaaaagcatgacccaggagtccttccaggagagctctgtgaaggagctgagg 
cggctggaggaccagctggccggcctgcagcaggagctggcggctctggcactgaagcagagctcggtgg 
cggaagaagtgggcctgctgccccagcagatccaggccgtgcgggacgacgtggaatctcagttcccggc 
ctggatcagtcagttccttgcccgaggtggagggggccgcgtggggctccttcagagagaggagatgcaa 
gctcagctgcgagagctggagagcaagatcctcacccatgtggcagagatgcagggcaagtcggccaggg 
aagccgcggcctccctgagcctgacgctncanaaagaaggtgtgattggagtgacagaggagcaggtgca 
ccacatcgtgaagcaggccctgcagcgctacagtgaggaccgcatcgggctggcagactacgccctggag 
tcaggaggggccagcgtcatcagcacccgatgttctgagacctacnagaccaagacggncctnctcagcc 
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TCTTNGGNATCCCCCTGGGGTACCACTCCCAGTCACCCCNAGTCATNCTCCANATGNGCACCC^GGCNAC 
TGNTGGGCCTTNCAGGGGCCANNGGGNTTNNCCGGGGNCCGCNTTTTTCCCNA 

Human UNC84B pray sequence - var2 (SEQIDNO:213) 

TGCCGCC&TGGTAGTACCCATACGACGTACCAGTATTACGCTCATATGGCTCATGGCAGGCCAGTGAATT 

' CCACCCAAGCAGTGGTATCAACGCAGAGTGGCCATTATGGOT 

^^TCAOTTTCAGGCCCCTACGATGGCCACGTACCAGGTGGTGGA 

CCCCGAGTACACCTGCATCTACCGCTTCAGAGTGCATGGGGAGCCCGCCCACTAGCCCTGCTTACTGGTG 
CCCGCTGCCAGCCATCTGGGAGTGGGTGAACAGCACCCCGCCGCTTCCCCCACACGCTTGCTCGGCGCTC 
TGACTTCTAGGAGC^CAAGAGAGGAGCCTGTGGCCCCATGC^GATGTU^AAGGACGGGCAGGGTCTCCTGA 

GCANCAGGTGGCTCGAGGCGGTTAGCANGCTCG^CM 

TCTGCTTCCCATCCTGGGAGTGTGTNTATATNTANCATATCATGGGGGACTGG 

Unigene Name : VCY2 1 PI Unigene ID : Hs . 66048 Clone ID : GD_181 
Human VCY2IP1 mRNA sequence - varl (public gi: 22002952) (SEQ ID NO: 214) 

AAGATGGCGGCGGTGGCTGGATCTGGGGCTGCCGCGGCTCCGAGCTCACTGCTCCTCGTGGTGGGCAGCG 
AGTTCGGGAGCCCGGGGCTCCTCACCTACGTCCTGGAGGAGCTCGAAAGAGGCATCCGGTCTTGGGATGT 
CGATCCTGGCGTCTGCAACCTTGATGAACAGCTCAAGGTCTTTGTGTCCCGACACTCTGCCACCTTCTCC 
AGCATTGTGAAAGGCCAGCGGAGCCTGCACCACCGTGGAGACAACCTGGAGACCCTGGTCCTCCTGAACC 
CATCAGACAAGTCCCTGTATGATGAGCTCCGGAACCTTCTGTTGGACCCTGCCTCTCACAAGCTACTGGT 
GTTGGCTGGGCTCTGCCTGGAGGAGACGGGGGAGCTGCTGCTACAGACAGGGGGCTTCTCGCCTCACCAC 
TTCCTCCAGGTCCTGAAGGACAGAGAGATCCGGGACATCCTGGCCACCACGCCCCCACCTGTGCAGCCGC 
CCATACTCACCATCACCTGCCCCACCTTCGGTGACTGGGCTCAGCCGGCACCCGCT 

GGGGGCGCTCCGGCTCCAGCTGCGGCTGAACCCCCCGGCGCAGCTGCCCAACTCTGAGGGCCTGTGCGAA 
TTCCTGGAGTACGTGGCTGAGTCTCTGGAGCCACCGTCCCCCTTCGAGCTGCTGGAGCCCCCGACCTCCG 

GGGGCTTCCTCAGGCTGGGCCGGCCCTGCTGCTACATCCT 
CGCCGTCAATGGCTT(^CTGTGCTGGTCAACGGTGGCT 

CGGCACCTGGACCGCGTGGATGCCGTGCTGGTGACCCACCCTGGCGCCGACAGCCTCCCCGGCCTCAACA 
GCCTGCTGCGGCGCAAACTGGCGGAGCGCTCCGAGGTGGCTGCTGGTGGGGGCTCCTGGGACGACAGGCT 
GCGCAGGCTCATCTCCCCCAACCTGGGGGTC^ 

CGCGGCGAGGATGAGGCGGAGCn\3GCGCTGAGCCTCCTGGCGCAGCTGGGCATCACGCCTC 

GCCGCGGCCCCGTTCCAGCCAAACCCACCGTGCTCTTC6AGAAGAT 

TGTGCTGCACCCGCCCTCCGCCGGCGCCGAGCGCAC^ 

CCCGCCGGCCCCGGCGAGAAGGTGGTGCGCGTGCTGTTCCCCGGTTGCACCCCGCCCGCCTGCCTCCTGG 
ACGGCCTGGTCCGCCTGCAGCACTTGAGGTTCCTGCGAGAGCCCGTGGTGACGCCCCAGGACCTGGAGGG 
GCCGGGGCGAGCCGAGAGCAAAGAGAGCGTGGGCTCCCGGGACAGCTCGAAGAGAGAGGGCCTCCTGGCC 
ACCCACCCTAGACCTGGCCAGGAGCGCCCTGGGGTGGCCCGCAAGGAGCCAGCACGGGCTGAGGCCCCAC 
GCAAGACTGAGAAAGAAGCCAAGACCCCCCGGGAGTTGAGGAAAGACCCCAAACCGAGTGTCTCCCGGAC 
CCAGCCGCGGGAGGTGCGCCGGGCAGCCTCTTCTGTGCCCAACCTCAAGAAGACGAATGCCCAGGCGGCA 
CCCAAGCCCCGCAAAGCGCCCAGCACGTCCCACTCTGGCTTCCCGCCGGTGGCAAATGGACCCCGCAGCC 
CGCCCAGCCTCCGATGTGGAGAAGCCAGCCCCCCCAGTGCAGCCTGCGGCTCTCCGGCCTCCCAGCTGGT 
GGCC^CGCCCAGCCTGGAGCTGGGGCCGATCCCA^^ 

GCC^GCTCAATCCCAAGGCCACGCACACCCTCCCCTGAGTCCCACCGGAGCCCCGCAGAGGGCAGCGAGC 
GGCTTGTCGCTGAGCCGACTGCGGGGCGGGGAGGCCGGGCGAGAC^ 

GGTGACCACGCCCTCACTACCCGCAGAGGTGGGCTCCCCGCACTCGACCGAGGTGGACGAGTCCCTGTCG 
GTGTCCTTTGAGCAGGTGCTGCCGCCATCCGCCCCCA^ 

GCCCCTOGGCGCGGCGCTCGGCTTCCCCACACGATGTGGACCTGTGCCTGGTGTCACCCTGTG 

G(^TCGCAAGGCGGTGCCAATGGCACCGGCACCTGCGTCCCCCGGCAGCTCGAATGACAGCAGTG 

TCACAGGAACGGGCAGGTGGGCTGGGGGCCGAGGAGACGCCACCCACATCGGTCAGCGAGTCCCT 

CCCTGTCTGACTCGGATCCCGTGCCCCTGGCCCCCGGTGCGGCAGACTCAGACGAAGACACAGAGGGCTT 

TCGAGTCCCTCGCC^CGACCCTTTGCCTGACCCCCTCAAGGTCCCCCCA 

ATCTGCATGGTGGACCCCGAGATGCTGCCCCCCAAGACAG^ 

GGAAGCCCCTGGCCCGCCCC^CTCACGCPCTGCCGCCCCCAAAGCCACTCCAGTCG 

CAAGGGGCTTGCTGGTGGGGACCGTGCCAGCCGACCACrCAGTGCCCGGAGTGAGCCCAGT^ 

GGCCGGGCACCCCTGTCCAGAAAGTCCTCAACCC^ 

GCAGCCGGCCCGGGGTGTCAGCC^CCCCACCCAAGTCCCCGGTCTACCTGGACCTGGCCTACCTGCCCAG 

CGGGAGCAGCGCC<^CCTGGTGGATGAGGAGTTCTTCC^^^ 
GGCCAGGACCAGCGCAAGGAGGAAGGCATGCGGGCCGTCCTGGACGCGCT^ 

GGGACCGTGACCTGCAGGTGACCCTGATCCCCACTTTCXxACTCGGTGGCCATGCATACGTGGTACGCAGA 
GACGCACGCCCGGCACCAGGCGCTGGGCATCACGGTGTTGGGCAGCAACAG 

GACGCCTTCCCGGCCTGCAAGGTGGAGTTCTAGCCCCATCGCCGACACGCCCCCCACTCAGCCCAGCCCG 
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CCTGTCCCTAGATTCAGCCACATCAGAAATAAACTGTGACTAC 

Human VCY2EP1 niRNA sequence - var2 (public gi: 21739762) (SEQ ID NO: 215) 

CCGAAGATGGCGGCGGTGGCTGGATCTGGGGCTGCCGCGGCTCCGAGCTCACTGCTCCTCGTGGTGGGCA 

GCGAGTTCGGGAGCCCGGGGCTCCTCACCTACGTCCTGGAGGAGCTCGAAAGAGGCATCCX3GTCTTGGGA 

TGTGGATCCTGGCGTCTGCT^CCTTGATGAACAGCTCAAGGTCTTTGTGTCCCGACACTCT 

TCCAGCATTGTGAAAGGCCAGCGGAGCCTGGACC^CCGTGGAGACAACCTGGAGACCCTGGTCCTCCTGA 

ACCCATCAGACAAGTCCCTGTATGATGAGCTCCGGAACCTTCTGTTGGACCCTGCCTCTCACAAGCTACT 

GGTGTTGGCTGGGCCCTGCCTGGAGGAGACGGGGGAGCTGCTG^ 

(^CTTCCTCCAGGTCCTGAAGGACAGAGAGATCCGGGACATCCTGGCC^CCACGCCCCCACCTGTGCAGC 

CGCCCATACTCACCATCACCTGCCCCACCTTCGGTGACTGGGCTCAGCTGGCACCCGCTGTGC 

TCAGGGGGCGCTCCGGCTCCAGCTGCGGCTGAACCCCCCGGCGCAGCTGCCCAACTCTGAGGGCCTGTGC 

GAATTCCTGGAGTACGTGGCTGAGTCTCTGGAGCCACCGTCCCCCTTCGAGCTGCTGGAGCCCCCGACCT 

CCGGGGG CTTCCTC^GGCTGGGCCGGCCCrGCTGCTACATCTTCCCTGGAGGCCTCGGGGATGCCGCCTT 

CTTCGCCGTCAATGGCTTCACTGTGCTGGTCAACGGTGGCTCAAACCCCAAGTCCAGTTTCTGGAAGCTG 

GTGCGGCACCTGGACCGCGTGGATGCCGTGCTGGTGACCCACCCTGGCGCCGAGAGCCTCCCTGGCCTCA 

ACAGCCTGCTGCGGCGCAAACTGGCGGAGCGCTCCGAGGTGGCTGCTGGTGGGGGCTCCTGGGACGACAG 

GCTGCGCAGGCTCATCTCCCCCAACCTGGGGGTCGTGTTCTTCAACGCCTGCGAGGCCGCGTCGCGGCTG 

GCGCGCGGCGAGGATGAGGCGGAGCTGGCGCTGAGCCTCCTGGCGCAGCTGGGCATCACGCCTCTGCCAC 

TCAGCCGCGGCCCCGTGCCAGCCAAACCCACCGTGCTCTTCGAGAAGATGGGCGTGGGCCGGCTGGACAT 

GTATGTGCTGCACCCGCCCTCCGCCGGCGCCGAGCGCACGCTGGCCTCTGTGTGCGCCCTGCTGGTGTGG 

CACCCCGCCGGCCCCGGCGAGAAGGTGGTGCGCGTGCTGTTCCCCGGTTGCACCCCGCCCGCCTACCTCC 

TGGACGGCCTGGTCCGCCTGCAGCACTTGAGGTTCCTGCGAGAGCCCGTGGTGACGCCCCAGGACCTGGA 

GGGGCCGGGGCGAGCCGAGAGCAAAGAGAGCGTGGGCTCCCGGGACAGCTCGAAGAGAGAGGGCCTCCTG 

GCCACCCACCCTAGACCTGGCCAGGAGCGCCCTGGGGTGGCCCGCAAGGAGCCAGCACGGGCTGAGGCCC 

CACGCAAGACTGAGAAAGAAGCCAAGACCCCCCGGGAGTTGAAGAAAGACCCCAAACCGAGTGTCTCCCG 

GACCCAGCCGCGGGAGGTGCGCCGGGCAGCCTCTTCTGTGCCCAACCTCAAGAAGACGAATGCCCAGGCG 

feCACCCAAGCCCCGC^^GCGCCCAGCACGTCCC^CTCT^ 

GCCCGCCCAGCCTCCGATGTGGAGAAGCCAGCCCCCCCAGTGCAGCCTGCGGCTCTCCGGCCTCCCAGCT 
GGTGGCCACGCCCAGCCTGGAGCTGGGGCCGATCCCAGCCGGGGAGGAGAAGGCACTGGAGCTGCCTTTG 
GCCGCCAGCTC^TCCC^GGCCACGCACACCCTCCCCTGAGTCCCACCGGAGCCCCGCAGAGGGCAGCG 
AGCGGCTGTCGCTGAGCCCACTGCGGGGCGGGGAGGCCGGGCCAGACGCCTCACCCACAGTGACCACACC 
CACGGTGACCACGCCCTCACTACCCGCAGAGGTGGGCTCCCCGCACTCGACCGAGGTGGACGAGTCCCTG 
TCGGTGTCCTTTGAGCAGGTGCTGCCGCCATCCGCCCCCACCAGTGAGGCTGGGCTGAGCCTCCCGCTGC 
GTGGCCCCCGGGCGCGGCGCTCGGCTTCCCCACACGATGTGGACCTGTGCCTGGTGTCACCCTGTGAATT 
TGAGCATCGCAAGGCGGTGCCAATGGCACCGGCACCTGCGTCCCCCGGCAGCTCGAATGACAGCAGTGCC 
CGGTCACAGGAACGGGCAGGTGGGCTGGGGGCCGAGGAGACGCCACCCACATCGGTCAGCGAGTCCCTGC 
CCACCCTGTCTGACTCGGATCCCGTGCCCCTGGCCCCCGGTGCGGCAGACTCAGACGAAGACACAGAGGG 
CTTTGGAGTCCCTCGCCACGACCCTTTGCCTGACCCCCTCAAGGTCCCCCCACCACTGCCTGACCCATCC 
AGCATCTGCATGGTGGACCCCGAGATGCTGCCCCCCAAGACAGCACGGCAT^ACGGAGAACGTCAGCCGCA 
CCCGGAAGCCCCTGGCCCGCCCCAACTCACGCGCTGCCGCCCCCAAAGCCACTCCAGTGdCTGCTGCCAA 

AACCAAGGGGCTTGCTGGTGGGGACCGTGCCAGCCX3ACCA 

GGAGGCCGGGCACCCCTGTCCAGAAAGTCCTCAACCCCCAAGACTGCCACTCGAGGCCCGTCGGGGTCAG 

CCAGCAGCCGGCCCGGGGTGTCAGCCACCCCACCCAAGTCCCCGGTCTACCTGGACCTGGCCTACCTGCC 

CAGCGGGAGCAGCGCCCACCTGGTGGATGAGGAGTTCTTCCAGCGCGTGCGCGCGCTCTGCTACGTCATC 

AGTGGCCAGGACCAGCGCAAGGAGGAAGGCATGCGGGCCGTCCTGGACGCGCTACTGGCCAGCAAGCAGC 

ATTGGGACCGTGACCnXSCAGGTGACCCTGATCCCC^CTTTCGACTCGGTGGCCATGCATACGTGGTACGC 

AGAGACGCACGCCCGGCACCAGGCGCTGGGCATCACGGTGTTGGGCAGCAACAGCATGGTGTCCATGCAG 

GATGACGCCTTCCCGGCCTGCAAGGTGGAGTTCTAGCCCCATCGCCGACACGCCCCCCACTCAGCCCAGC 

CCGCCTGTCCCTAGATTCAGCCAGATCAGAAATAAACT 

AA 

Human VCY2IP1 mRNA sequence - var3 (pubHcgi: 21104445) (SEQ ID NO: 216) 

CCGAGGTGGCTGCTGGTGGGGGCTCCTGGGACGACAGGCTC 

CGTGTTCTTCAACGCCTGCGAGGCCGCGTCGCGGCTGGCGCGCGGCGAGGATGAGGCGGAGCTGGCGCTG 
AGCCTCCTGGCGGAGCTGGGCATCACGC^ 

TGCTCTTCGAGAAGATGGGCGTGGGCCGGCTGGACATGTATGTGCTGGACCCGCCCTCCGCCGGCGCCGA 

GTCCACGCTGGCCTCTGTGTGCGCCCTGCTGGTGTGGCACCCCGCCGGCCCCGGCGAGAAGGTGGTGCGC 

GTGCTGTTCCCCGGTTGC^CCCCGCCCGCCTGCCTCCTGGACGGCCTGGTCCGCCTGCAGCACTO 

TCCTGCGAGAGCCCGTGGTGACGCCCCAGGACCTGGAGGGGCCGGGGCGAGCCGAGAGCAAAGAGAGCGT 

GGGCTCCCGGGACAGCTCGAAGAGAGAGGGCCTCCTGGCCACCCACCCTAGACCTGGCCAGGAGCGCCCT 

GGGGTGGCCCG(^^GGAGCCAGCACGGGCTGAGGCCCCACGCAAGACTGAGAAAGAAGCCAAGGCCCCCC 
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GGGAGTTGAAGAAAGACCCCAAACCGAGTGTCTCCCGGACCCAGCCGCGGGAGGTGCGCCGGGCAGCCTC 
TTCTGTGCCCAACCTCAAGAAGACGAATGCCCAGGCTC 

CACTCTGGCTTCCCGCCGGTGGCTIAATGGACCCCGCAGCCCGCCCAGCCTCC^ 

CCCCCAGTGCAGCCTGCGGCTCTCCGGCCTCCCAGCTGGTGGCCACGCCCAGCCTGGAGCTGGGGCCGAT 
CCCAGCCGGGGAGGAGAAGGCACTGGAGCTGCCTTTGGCCGC 

TCCCCTGAGTCCCACCGGAGCCCCGCAGAGGGCAGCGAGCGGCTGTCGCTGAGCCCACTGCGGGGCGGGG 
AGGCCGGGCCAGACGCCTCACCCACAGTGACCACACCCACGGTGACCACGCCCTCACTACCCGCAGAGGT 
GGGCTCCCCGCACTCGACCGAGGTGGACGAGTCCCTGTCGGTGTCCTTTGAGCAGGTGCTGCCGCCATCC 
GCCCCCACCAGTGAGGCTGGGCTGAGCCTCCCGCTGCGTGGCCCCCGGGCGCGGCGCTCGGCTTCCCCAC 
ACGATGTGGACCTGTGCCTGGTGTCACCCTGTGAATTTGAGCATCGCAAGGCGGTGCCAATGGCACCGGC 
ACCTGCGTCCCCCGGCAGCTCGAATGACAGCAGTGCCCGGTCACAGGAACGGGCAGGTGGGCTGGGGGCC 
GAGGAGACGCCACCCACATCGGTCAGCGAGTCCCTGCCCACCCTGTCTGACTCGGATCCCGTGCCCCTGG 
CCCCCGGTGCGGC^GACTCAGACGAAGACACAGAGGGC^^ 

CCCCCT(^GGTCCCCCC^CCACrGCCTCACCCATCC^GCATCTGCATGGTGGACCCCGAGATGCTGCC^ 
CCCAAGACAGCACGGCAAACGGAGAACGTCAGCCG(^CCCGGAAGCCCOTGGCCCGCCCCAACTCACGCG 
CTGCCGCCCCCAAAGCCACTCCAGTGGCTGCTGCCAAAACCAAGGGGCTTGCTGGTGGGGACCGTGCCAG 
CCGACCACTCAGTGCCCGGAGTGAGCCCAGTGAGAAGGGAGGCCGGGCACCCCTGTCCAGAAAGTCCTCA 
ACCCCCAAGACTGCCACTCGAGGCCCGTCGGGGTCAGCCAGCAGCCGGCCCGGGGTGTCAGCCACCCCAC 
CCAAGTCCCCGGTCTACCTGGACCTGGCCTACCTGCCCAGCGGGAGCAGCGCCCACCTGGTGGATGAGGA 
GTTCTTCCAGCGCGTGCGCGCGCTCTGCTACGTCATCAGTGGCCAGGACCAGCGCAAGGAGGAAGGCATG 
CGGGCCGTCCTGGACGCGCTACTGGCCAGCAAGCAGCATTGGGACCGTGACCTGCAGGTGACCCTGATCC 
CCACTTTCGACTCGGTGGCCATGCATACGTGGTACGCAGAGACGCACGCCCGGCACCAGGCGCTGGGCAT 
CACGGTGTTGGGCAGCAACAGCATGGTGTCCATGCAGGATGACGCCTTCCCGGCCTGCAAGGTGGAGTTC 
TAGCCCCATCGCCGACACGCCCCCC^CTCAGCCC^GCCCGCCTGTCCCTAGATTCAGCCAC^TCAGAAAT 

AAACTGTGACTTCCAAAAAAAAAAA 

Human VCY2IP1 niRNA sequence- var4 (public gi: 14250679) (seq id no: 217) 

GGGACGAGGCCGCCTTCTTCGCCGTCAATGGCTTCACT 

CAGTTTCTGGAAGCTGGTGCGGCACCTGGACCGCGTGGATGCCGTGCTGGTGACCCACCCTGGCGCCGAC 

AGCCTCCCCGGCCTCAACAGCCTGCTGCGGCGCAAACTGGCGGAGCGCTCCGAGGTGGCTGCTGGTGGGG 

GCTCCTGGGACGACAGGCTGCGCAGGCTCATCTCCCCCAACCTGGGGGTCGTGTTCTTCAACGCCTGCGA 

GGCCGCGTCGCGGCTGGCGCGCGGCGAGGATGAGGCGGAGCTGGCGCTGAGCCTCCTGGCGCAGCTGGGC 

ATCA03CCTCTGCCACTCAGCCGCGGCCCCGTGC 

TGGGCCGGCTGGACATGTATGTGCTGCACCCGCCCTCC^ 

CGCCCTGCTGGTGTGGCACCCCGCCGGCCCCGGCGAGAAGGTGGTGCGCGTGCTGTTCCCCGGTTGCACC 

CCGCCCGCCTGCCTCCTGGACGGCCTGGTCCGCCTGCAGCACTTGAGGTTCCTGCGAGAGCCCGTGGTGA 

CGCCCC!yiGGACCTGGAGGGGCCX3GGGCGAGCCGAGAGCAAAGAGAGCGTGGGCTCCCGGGACAGCTCGAA 

GAGAGAGGGCCTCCTGGCCACCCACCCTAGACCTGGCCAGGAGCGCCCTGGGGTGGCCCGCAAGGAGCCA 

GCACGGGCTGAGGCCCCATOC^GACTGAGAAAGAAGCCAAGACCCCCCGGGAGTTGAAGAAAGACCCCA 

AACCGAGTGTCTCCCGGACCCAGCCGCGGGAGGTGCGCCGGGCAGCCTCTTCTGTGCCCAACCTCAAGAA 

GACGAATGCCCAGGCGGCACCCAAGCCCCGCAAAGCGCCCAGCATOTCCCACTCTGGCTTCCCGCCGGTG 

GCAAATGGACCCCGCAGCCCGCCCAGCCTCCGATGTGGAGAAGCCAGCCCCCCCAGTGGAGC 

CTCCGGCCTCCCAGCTGGTGGCCACGCCCAGCCTGGAGCT 

ACTGGAGCTGCCTTTGGCCGCCAGCTCAATCCCAAGGCCACGCACACCCTCCCCTGAGTCCCACCGGAGC 

CCCGCAGAGGGCAGCGAGCGGCTGTCGCTCAGCCCACT^ 

CCACAGTGACC^CACCC^CGGTGACCAaSC^ 

GGTGGACGAGTCCCTGTCGGTGTCCTTTGAGCAGGTGCTGCCGCCATCCGCCCCCACCAGTGA 

CTGAGCCTCCCGCTGCGTGGCCCCCGGGCGOSGCGCTCGGCTTCCCCAC^CGATGTO 

TGTCACCCTGTGAATTTGAG(^TCGCAAGGCX5GTGCCAATGGCACCGGCACCTGCG 

GAATGACAGCAGTGCCCGGTCACAGGAACGGGCAGGTGGGCTGGGGGCCGAGGAGACGCCACCCACATCG 

GTCT^GCGAGTCCCTGCCCACCCTGTCTGACTCGGAT^ 

ACGAAGACACAGAGGGCTTTGGAGTCCCTCGCCACGACC^ 

ACTGCCIXSACCCATCCAGCATCrc^ 

GAGAACGTCAGCCGCACCCGGAAGCCCCTGGCCCGCCCCJ^^ 
CAGTGGCTGCTGCGAAAACCAAGGGGCTTGCTC 

TGAGCCCAGTGAGAAGGGAGGCCX^GCACCCCTGTCCAGAAAGTCCTCAACCCCCAAGACT 
GGCCCGTCGGGGTCAGCCAGCAGCCGGCCCGGGGTGTCAGCCACCCCACCCAAGTCCCCGGTCTACCTGG 

ACCTGGCCTACCTGCCCAGCGGGAGCAGCGCCCACCTGGT^ 

GCTCTGCTACGTGATC^GTGGCCAGGACCAGCGC^ 

CTGGCCAGCAAGCAGCATTGGGACCGTGACCTGCAGGTC 

TGC^TACGTGGTACGCAGAGACGCACGCCCGGCACCAGGCGCTGGGCATCACGGTO 

CATGGTGTCCATGCAGGATGACXSCCTTCCCGGCTTGC^ 

CCCCACTCAGCCCAGCCCGCCTGTCCCTAGATTCAGCCACA 
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AAAAAAAAAAAAAAAAAAAAAAAA 

Human VCY2BP 1 mRNA sequence - var5 (pubi ic 9 i : 13938254) (SEQ ID NO: 218) 

GACACCGACAGGGACTCGTCCACCTCGGTGTCC^ 
^AGGCTGGGCTGAGCCTCCCGCTGCGTGGCCCCCGGGCGCGGCGCTCGGCTTCCCCACACGATGTGGACCT 
GTGCCTGGTGTCACCCTGTGAATTTG7VGCATCGCAAGGCGGTGCCAATGGCACCGGCACCT 

^^-G<jCAGCTCGAATGACAGCAGT^ 

CCACATCGGTCAGCGAGTCCCTGCCCACCCTGTCTGACTCGGATCCCGTGCCCCTGGCCCCCGGTGCGGC 
AGACTCAGACGAAGACACAGAGGGCTTTGGAGTCCCTCGCCACGACCCTTTGCCTGACCCCCTCAAGGTC 
CCCCCACCACTGCCTGACCCATCCAGC^TCTGC^ 

GGCAAACGGAGAACGTCAGCCGCACCCGGAAGCCCCTGGCCCGCCCCAACTCACGC^ 

AGCCACTCCAGTGGCTGCTGCCAAAACGAAG 

GCCCGGAGTGAGCCCAGTGAGAAGGGAGGCCGGGCACCCCTGTCC^GAAAGTCCTCAACCCCCAAGACTG 
CCACTCGAGGCCCGTCGGGGTCAGCCAGCAGCCGGCCCGGG^^ 

CTACCTGGACCTGGCCTACCTGCCCAGCGGGAGCAGCGCCCACCTGGTGGATGAGGAGTTCTTCCAGCGC 
GTGCGCGCGCTCTGCTACGTCATCAGTGGCCAGGACCAGCGCAAGGAGGAAGGCATGCX3GGCCGTCCTGG 
ACGCGCTACTGGCCAGCAAGCAGCATTGGGACCGTGACCTGCAGGTGACCCTGATCCCCACTTTCGACTC 
GGTGGCCATGCATACGTGGTACGCAGAGACGGACGCCCGGCACCAGGCGCTGGGCATCACGGTGTTGGGC 
AGCAACAGCATGGTGTCCATGCAGGATGACGCCTTCCCGGCCTGCAAGGTGGAGTTCTAGCCCCATCGCC 
GACACGCCCCC(^CTC^GCCCAGCCCGCCTGTCCCTAGATTCAGCC^CATCAGAAATAAACTGTGACTA^ 
ACTTAAAAAAAAAAAAAAAAAAAAAA 



Human VCY2IP1 raRN A sequence- var6 (public gi: 14042428) (seq id no: 219) 

AAGATGGa^CGGTGGCTGGATCTGGGGCTGCCGC^ 

AGTTCGGGAGCCCGGGGCTCCTCACCTACGTCCTGGAGGAGCTCGAAAGAGGCATCCGGTCTTGGGATGT 
CGATCCTGGCGTCTGCAAGCTTGATGAAGAGCTC^ 

AGCATTGTGAAAGGCCAGCGGAGCCTGCACCACCGTGGAGACAACCTGGAGACCCTGGTCCTCCTGAACC 
CATCAGACAAGTCCCTGTATGATGAGCTCCGGAACCTTCT^ . 
GTTGGCTGGGCTCTGCCTGGAGGAGACGGGGGAGCTGCTGCTAC^ 

TTCCTCCAGGTCCTGAAGGACAGAGAGATCCGGGACATCCTGGCCACCACGCCCCCACCT 
CCATACTCACGftTCACCTGCCCCACCTTCGGTGACT^ 

GGGGGCGCTCCGGCTCCAGCTGCGGCTGAACCCCCCGGCGCAGCTGCCCT^ACTCTGAGGGCCTGTGCGAA 
TTCCTGGAGTACGTGGCTGAGTCTCTGGAGCCACCGTCCCCCTTCGAGCTGCTGGAGCCCCCGACCTCCG 
GGGGCnTCCTCAGGCTGGGCCGGCCCTGCTGCTA 

CGCCGTCAATGGCTTCACTGTGCTGGTCAACGGTGGCTC^AACCCCAAGTCCAGTTTCTGGAAGCTGGTG 
CGGCACCTGGACCGCGTGGATGCCGTGCTGGTGACCCACCCTGGCGCCGACAGCCTCCCCGGCCTCAACA 
GCCTGCTGCGGCGCAAACTGGCGGAGCGCTCCGAGGTGGCTGCTGGTGGGGGCTCCTGGGACGACAGGCT 
GCGCAGGCTCATCTCCCCCAACCTGGGGGTCGTGTTCT^ 

CGCGGCGAGGATGAGGCGGAGCTGGCGCTGAGCCTCCTGGCGCAGCTGGGCATCACGCCTCTGCCACTCA 

GCCGCGGCCCCGTGCCAGCCAAACCCACCGTGCTCTTCGAGAAGATGGGCGTGGGCCGGCTGGACATGTA 

TGTGCTGCACCCGCCCTCCGCCGGCGCCGAGCGCACGCTGGCCTCTGTGTGCGCCCTGCTGGTGTGGCAC 

CCCGCCGGCCCCGGCGAGAAGGTGGTGCGCGTGCTGTTCCCCGGTTGCACCCCGCCCGCCTGCCTCCTGG 

ACGGCCTGGTCCGCCTGCAGCACTTGAGGTTCCTGCGAGAGCCCGTGGTGACGCCCCAGGACCTGGAGGG 

GCCGGGGCGAGCCGAGAGCAAAGAGAGCGTGGGCTCCCGGGACAGCTCGAAGAGAGAGGGCCTCCTGGCC 

ACCCACCCTAGACCTGGCCAGGAGCGCCCTGGGGTGGCCCGCAAGGAGCC^^ 

GCAAGACTGAGAAAGAAGCCAAGACCCCCCGGGAGTTGAGGAAAGACCC 

CCAGCCGCGGGAGGTGCGCCGGGCAGCCTCTTCTGTGCCCAACCTCAAGAAGACGAATGCCCAGGCGGCA 
CCCAAGCCCCGC^UVAGCGCCCAGCACGTCCCACTCTC^ 

CGCCCAGCCTCCGATGTGGAGAAGCCAGCCCCCCCAGTGCAGCCTGCGGCTCTCCGGCCTCCCAGCTGGT 

GGCCACGCCCyVGCCTGGAGCTGGGGCCGATCCCAGCCGGGGAGGAGAAGGCACT 

GCCAGCTC2^TCCCAAGGCCACG(^CACCCTCrc 

GGCTGTCGCTGAGCCCACTGCGGGGCGGGGAGGCCGGGCCAGACGCCTCACCCACAGTGACC^CACCCAC 
GGTGACCACGCCCTCACTACCCGC^GAGGTGGGCTCCCCG(^CTCGACCGAGGTGGACGAGTCCCTGTCG 
GTGTCCTTTGAGCAGGTGCTGCCGCCATCCGCCCCCACCAGTGAGGCTGGGCTGAGCOTCCCGCTGCGTG 
GCCCCCGGGCGCGGCGCTCGGCTTCCCCACACGATGTGGACCTGTGCCTGGTGTCACCCTGTGAATTTGA 
GCATCGCAAGGCGGTGCCAATGGCACCGGCACC^ 

TCACAGGAACGGGC^GGTGGGCTGGGGGCCGAGGAGACGCCACCCACATCGGTCAGCGAGTCCCTC 

CCCTGTOTGACTCGGATCCroTGCCCCTGGCCCCCGGTGCGGCAGACTCAGACGAAGACACAGAGGGCTT 

TGGAGTCCCTCGCCAO^CCCTTTGCCTGACCCCCT 

ATCTGCATGGTGGACCCC^AGATGCTGCCCCCCCAAGACAGCACGGCAAACGGAGAACGTCAGCCGCACC 
03GAAGCCCCTGGCCCGCCCCAACT 

CCAAGGGGCTTGCTGGTGGGGACCGTGCCAGCCGACCACTCAGTGCCCGGAGTGAGCCCAGTGAGAAGGG 
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AGGCCGGG^CCCCTGTCCAGAAAGTCCTCAACCCCCAAGACTGCCACT03AGGCCCGTCGGGGTC^GCC 
AGCAGCCGGCCCGGGGTGTCAGOIIACCCC^CCCAAGTCC^ 

GCGGGAGCAGCGCCCACCTGGTGGATGAGGAGTTCTTCCAGCGCGTGCGCGCGCTCTGCT 

TGGCCAGGACCAGCGCAAGGAGGAAGGCATGCGGGCCGTCCTGGACGCGCTACXGGCCAGCAAGCAGCAT 

TGGGACCGTGACCTGCAGGTGACCCTGATCCCCACTTTCGACTCGGTGGCCATGC^TAC^TGGTACGCAG 

AGACGCACGCCCGGCACCAGGCGCTGGGCATCACGGTSTTGGGCAGCAACAGCATGGTGTCCATGCAGGA 

TGACGCCTTCCCGGCCTGCAAGGTGGAGTTCTAGCCCCATCGCCGACACGCCCCCCACTCAGCCCAGCCC 

GCCTGTCCCTAGATTCAGCCACATCAGAAATAAACTGTGACTACACTTG 

Human VCY2IP1 itlRNA sequence- var7 (public gi: 13623504) (SEQ ID NO: 220) 

GGCACGAGGCCCTGTATGATGAGCTCCGGAACCTTCTGTTGGACCCTGCCT 

GGCTGGGCCCTGCCTGGAGGAGACGGGGGAGCTGCTGCTACAGACAGGGGGCTTCTCGCCTCACCA 
CTCCAGGTCCTGAAGGACAGAGAGATCCGGGACATCCTGGCC^ 

TACTCACCATCACCTGCCCCACCTTCGGTGACTGGGCTCAGCTGGCACCCGCTGTGCCTGGCC 
GGCGCTCCGGCTCCAGCTGCGGCTGAACCCCCCGGCGCAGCTGCCCAACT 

CTGGAGTACGTGGCTGAGTCTCTGGAGCCACCGTCCCCCTTCGAGCTGCTGGAGCCCCCGACCTCCGGGG 
GCTTCCTCAGGCTGGGCCGGCCCTGCTGCTACATCTTCCCTGGAGGCCTCGGGGATGCCGCCTTCTTCGC 
CGTCAATGGCTTCACTGTGCTGGTCAACGGTGGCTCAAACCCCA 

CACCTGGACCGCGTGGATGCCGTGCTGGTGACCCACCCTGGCGCCGACAGCCTCCCCGGCCTCAACAGCC 
TGCTGCGGCGC7VAACTGGCXK3AGCGCTCCGAGGTGGCTGCTGGTC 

CAGGCTCATCTCCCCCAACCTGGGGGTCGTGTTCTTCAACGCCTGCGAGGCCGCGTCGCX3GCTGGCGCGC 
GGCGAGGATGAGGCGGAGCTGGCGCTGAGCCTCCTGGCGCAGCTGGGCATCACGCCTCTGCCACTCAGCC 
GCGGCCCCGTGCCAGCCAAACCCACCGTGCTCTTCGAGAAGATGGGCGTGGGCCGGCTGGACATGTATGT 
GCTGCACCCGCCCTCCGCCGGCGCCGAGCGCACGCTGGCCTGTGTGTGCGCCCTGCTGGTGTGGCACCCC 
GCCGGCCCCGGCGAGAAGGTGGTGTOCGTGCTGTTCCCCGGTTGCACCCCGCCCGCCTGCCTCCTGGACG 
GCCTGGTCCGCCTGCAGCACTTGAGGTTCCTGCGAGAGCCCGTGGTGACGCCCCAGGACCTGGAGGGGCC 
GGGGCGAGCCGAGAGCAT^GAGAGCGTGGGCTCCCGGGACAGCTCGAAGAGAGAGGGCCTCCTGGCCACC 
*CACCCTAGACCTGGCCAGGAGCGCCCTGGGGTGGCCOGCAAGGAGCCAGCA 

AGACT7GAGAAAGAAGCCAAGACCCCCCGGGAGTTGAAGAAAGACCCCAAACCGAGTGTCTCCCGGACCCA 
GCCGCGGGAGGTGCGCCGGGCAGCCTCTTCTGTGCCCAACCTCAAGAAGACGAATGCCCAGGCGGCACCC 
AAGCCCCGCAAAGCGCCCAGCACGTCCCACTCTGGCTTCCCGCCGGTGGCAAATGGACCCCGCAGCCCGC 
CCAGCCTCCGATGTGGAGAAGCCAGCCCCCCCAGTGCGGCCTGCGGCTOTCCGGCCTCCC^GCTGGTGGC 
CACGCCCAGCCTGGAGCTGGGGCCGATCCCAGCCGGGGAGGAGAAGGCACTGGAGCTGCCTTTGGCCGCC 

AGCTCAATCCCAAGGCCACGCACACCCTCCCCTGAGTCCC^ 

TGTCGCTGAGCCCACTGCGGGGCGGGGAGGCCGGGCCAGACGCCTCACCCACAGTGACCACACCCACGGT^ 

GACCACGCCOTCACTACCCGCAGAGGTGGGCTCCCCGCACTCGACCGAGGTGGACX3AGTCCCT 

TCCTTTGAGCAGGTCCTGCCGCCATCCGCCCCCACCAGTGAGGCTGGGCTGAGCCTCCCGCTGCGTGGCC 

CCCGGGaSCGGCGCTC^GCTTCCCCACACGATC^ 

TCGCAAGGCGGTGCCAATGGCACCGGCACCTGCGTCCCCCGGCAGCTCGAATGACAGCAGTGCCCGGTCA 
CAGGAACGGGCAGGTGGGCTGGGGGCCGAGGAGACGCCACCCACATCGGTCAGCGAGTCCCTGCCCACCC 
TGTCTGACTCGGATCCCGTGCCCCTGGCCCCCGGTGCGGCAGACTCAGACGAAGACACAGAGGGCTTTGG 
AGTCCOTCGCCACGACCCTTTGCCTGACCCCCTCAAGGTCCCCCCACCACTGCCTGACCCATCCAGCATC 
TGCATGGTGGACCCCGAGATGCTGCCCCQCAAGACAGCACGGCAAACGGAGAACGTCAGCCGCACCCGGA 
AGCCCCTGGCCCGCCCCAACTCACGCGCTGCCGCCCCCAAAGCCACTCCAGTGGCTGCTGCCAAAACCAA 
GGGGCTTGCTGGTGGGGACCGTGCCAGCCGACCACTCAGTGCCCGGAGTGAGCCCAGTGAGAAGGGAGGC 

CXXSGCACCCCTGTCCAGAAAGTCCTCAACCCCCAAGACT^ 
GCCGGCCCGGGGTGTCAGCCACCCCACCCAAGTCCCCGGTCrACCTGC^ 

GAGCAGCGCCCACCTGGTGGATGAGGAGTTCTTCCAGCGCGTGCGCGCGCTCTGCTACGTCATCAGTGGC 

CAGGACCAGCGCAAGGAGGAAGGCATGCGGGCCGTCCTGGACGCGCTACTGGCCAGCAAGCAGCATTGGG 

ACCGTGACCTGCAGGTGACCCTGATCCCCACTTTCGACTCGGTGGCCATGCATACGTGGTACGCAGAGAC 

GGACGCCCGGCACCA(3GCGCTGGGCATCACGGTGTTGGGCAGCAACAGCATGGTGTCCATGCAG^ 

GCCTTCCCGGCCTGC^GGTGGAGTTCTAGCCCCATCGCCGACACGCCCCCCACTCAGCCCAGCCCGCCT 

GTCCCTAGATTCAGCCACATCAGAAATAAACTGTGACTACACTTGAAAAAAAAAAAAAAAAAA 

Human VCY2IP1 mRNA sequence - var8 (public gi: 10434893) (SEQ IDNO: 221) 

GAACCCCAAGTCCAGTTTCTGGAAGCTGGTGC 

CCTGGCGCCGACAGCCTGCCCX3GCOTCAACAGCCTGCTGCGGCGCAAACTGGCGGAGCGCTCCGAGG 

CTGCTGGTGGGGGCTCCTGGGACGACAGGCTGCGCAGGCTCATCTCCCCCAACCTGGGGGTCGTGTTCTT 

CAACGCCTGCX3AGGCCGCGTCGCGGCTGGCGCGCGGCGAGGATGAGGCGGAGCTGGCGCTGAGCCTCCTG 

GCXjCAGCTGGGCATCACGCCTCTGCCACTCAGCCGCGGC^ 

AGAAGATGGGCGTGGGCCGGCTGGACATGTATGTGCTGCACCCGCCCTCCGCCGGCGCCGAGCGCACGCT 
GGCCTCTGTGTGCGCCCTGCTGGTGTGGCACCCCGCCGGCCCCGGCGAGAAGGTGGTGCGCGTGCTGTTC 
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GGACAGCTCGAAGAGAGAGGGCC 
CGCAAGGAGCCAGCACGGGCTGfl 
AGAAAGACCCCAAACCGAGTGTC 
CAACCTCAAGAAGACGAATGCCC 
TTCCCGCCGGTGGCAAATGGACCCCX3CAGCCCGw|-— 
CAGCCTGCGGCTCTCCGGCCTCCC^GCTGGTGGCCACGCCCAGCCTCGAGU-r^^ 

GGAGGAGA^GGCACTGGAGCTGCCTTTCGCCGCCAGCT^ 



goacagctcgaagagagagggcc^ 
cgcaaggagccagcacgggc 
agaaagaccccaaaccgagt 

CAACCTCAAGAAGACGAATG^uw,,. ^^^^TOTCGWSAROCCftGCCCCCCCAGXG 



CGCAAGGAGCCAGCACGGGCTGAGGCCCCACGCAAGACTGAGAAAGAAGCCAAG 

™ccccaa*ccga = ^^^ 



CGCTGAGCCCACTGCGGGGCGGGGAGGCCGGGC 



TCCCACCGGAGCCCCGCAGAGGGCAGCGAGCGGCTGT' - rvwnar araTGCMCTCCCC 

GCACGGCAAACGGAGAACGTCAGCC 

CCAAAGCCACTCCAGTGGCTGCTGCw*«™^™^ - - CCCAA< 



CGGTCTACCTGGACCTGGCCTACCTGCCCAGCGGGAGCAGCGCCCACCTGGTGGATGAGGA 
CTGGAGGCGCTACT^CCAGCAAGC^GCATTGGGACCGTGA^ 



CTAC 

u,,™*™ VPY9TP1 mRNA seauence - var9 (public gi= 7022843) (seq id no: 222) 

rCGTGCCAGCCAAACCCACCGTGCTCTTCGAGAAGATGGGCGTGGGCCGGCTGGACATGTATGTGCTGCA 

tcS?ctoc^cacSgaggttcctgcgagagcccgtggtgacgccccaggacctgga 

======= 

^ccc^ggccacgcacaccctcccctgagtcccaccggagccccgcagagggcagcgagcggct 

GGCGGTGCCAATGGCACCGG^CCTGCGTCCCCCGGCAGCTCGAATGACAG^ 

^ScGCCCCAACTCACGCGCTGCCGCCCCCAAAGCCACTCCAGT^ 
TG^TGG^TOGGGACCGTGGCAGCCGACCACTCAGTGCCCGG 

^^^TCCAGAAAGTCCTCAACCCCCAAGACTGCCACTCGAGGCCCGTCGGGGTCAGCCAGCAGCCGGC 

ccgSSg^ccc^caagtccccggtctacctg^^ 
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AG ATT CAG CCACATCAGAAATAAACTGTGACTAC ACTTG 



HiunauVCY2IPl Protein sequence - varl (public gi: 22002953) (seq id no: 315) 

vJrVKGGRSLHHROTNLETL^^ 

^V^iOSIRDIIATTPPPVQ^^ 
JXElTVAESIiEPPSPFELLEPPTSGGFIiRLGRPCCT 
— 1HLLDRVD AVLVTHPGAD S Ii PGLNS LLRRKLAERSE VAAGGGS WDDRLRRLI S PNLGWFFNACE AAS RliAR 
*' GEDEAEIiALSLiIiAQIiGI XPLPLSRGPVPAKPTVLFEKMGVGRI»DMYVLHPP SAGAERTLASVCALLVWHP 
AGPGEKVVRVIjFPGCTPPACLLDGLVRIjQHLRFLREPVVTPQDXjEGPGRAES KE S VGSRDSS KREGLIiAT 
HPRPGQERPGVARKEPARAEAPRKTEKEAKTPRELRKDPKPSVSRTQ 
KPRKAPSTSHSGFPPVANGPRSPPSI^CGEASPPSAATO^ 

SSIPRPRTPSPESHRSPAEGSERLSLSPLRGGEAGPDASPTVTTPTVTTPSLPAEVGSPHSTEVDESLSV 
SFEQVLPPSAPTSEAGLSLPLRGPRARRSASPHDVDLCLVS^ 

QERAGGLGAEETPPTSVSESLPTLSDSDPVPIAPGAADSDEDTEGFGVPRHDPLPDPIiKVPPPLPDPSSI 
CWDPEMLPPKTARQTENVSRTRKPIJUIPNSRAAAPKATPVAAAKTKGIjAGGDRASRPLSARSE 

RAPLSRKSSTPKTATRGPSGSASSRPGVSATPPKSPVYLDI^^ 
QDQRKEEGMRAVIJDALLASKQHWDRDLQVTLIPTFDSVAMHTWYAETH^ 

AFPACKVEF 

Hmnan VCY2EP1 Protein sequence- var2 (public gi: 21739763) (seq id no: 316) 

PKMAAVAGSGAAAAPSSLLLWGSEFGSPGLLTYVLEEl^^ 
S S I VKGQRSLHHRGDNLETLVLLNP SDKSLYDEIi^^ 

HFLQVLKDRE I RDI LATTPPPVQP P I LTI TCPTFGDWAQIiAPAVPGLQGAIjRIjQLRLNPPAQIiPNS EGLC 
EFLEYVAESLEPPSPFELLEPPTSGGFLRIjGRPCCYI FPGGIiGDAAFFAVNGFTVLVNGGSNPKSSFWKL 
VRHLDRVDAVLiVTHPGADS LPGIjNSIjIjRRKIiAERS EVAAGGGS WDDRLRRL I S PNLGWFFNACEAASRI* 

argedeaelalsliaq^itplplsrgpvpakptvlfek^^ 

HPAGPGEKVVRVIiFPGCTPPAYL^GLVRLQHIJlFLREPVVTPQDLEGPGRAESKE 
ATHPRPGQERPGVARKEPARAEAPRKTEKEAKTPRELKKDPKPSV 

APKPRKAPSTSHSGFPPVANGPRSPPSLRCGEASPPSAACGSPASQLVATPSLELGPIPAGEEKAIiEIiPL 
AASSIPRPRTPSPESHRSPAEGSERLSLSPIiRGGEAGPDASPTVTTPTVTTPSLPAEVGSPHSTEVDESL 
SVSFEQVLPPSAPTSEAGLSLPIjRGPRARRSASPHDVDLCLVSPCEFEHRKAVPMAPAPASPGSSNDSSA 
RSQERAGGLGAEETPPTSVSESLPTLSDSDPVPLAPGAADSDEDTEGFGVPRHDPLPDPLKVPPPIiPDPS 

SICMVDPEMLPPKTARQTENVSRTRKPLARP 

GGRAPLSRKSSTPKTATRGPSGSASSRPGVSATPPKSPVYLDLAYLPSGSSAHLVDEEFFQRVRALCYVI 
SGQDQRKEEGMRAVLDALLAS KQHWDRDLQVTL I PTFDSVAMHTWYAETHARHQALGITVIjGSNSMVSMQ 
DDAFPACKVEF 

Hiunan VCY2DP1 Protein sequence- var3 (public gi: 21104446) (seq id. no: 317) 

MGVGRLDMYVLHPPSAGAERTLASVC^IiVW^ 

VVTPQDLEGPGRAESKESVGSRDSSKREGLIATHPRPGQERPGVARKEPARAEAPRKTEKEAKAPR^ 

DPKPSVSRTQPREVRRAASSVPNLKKTNAQAAPKPRKAPSTSHSGFPPVANGPRSPPSLRCGEASPPS^ 

CGSPASQLVATPSLELGPIPAGEEKALELPIjAASSIPRPRTPSPESHRSPAEGSERIjSIjSPIiRGGEAGPD 

ASPTVTTPTVTTPSIiPAEVGSPHSTEVDESLSVSFEQVIiPPSAPTSEAGLSLPLRGPRARRSASPHDVDIj 

CLVSPCEFEHRKAVPMAPAPASPGSSNDSSARSQERAGGLGAEETPPTSVSESLPTLSDSDPVPLAPGAA 

DSDEDTEGFGVPRHDPLPDPIjKVPPPLPDPSSICMVDPEMLPPKTARQTEN^ 

ATPVAAAKTKGLAGGDRASRPLSARSEPSEKGGRAPLSRKSSTPKTATRGPSGSASSRPGVSATPPKSPV 

YLDLAYLPSGSSAHIiVDEEFFQRVRALCYVISGQDQRKEEGMRAVLDALIASKQHTO 

VAMHTWYAETHARHQAI^ITVIiGSNSMVSMQDDAFPACKVEF 

Human VCY2IP1 Protein sequence - var4 (public gi: 142506B0) (seq id no: 318) 

MGVGRLDMYVLHPPSAGAERTIiASVCALLWHPAGPGEKVVRVLFPGCTPPA 

VVTPQDIiEGPGRAESKESVGSRDSSKREGLLATHPRPGQERPGVARKEPARAEAPR 

DPKPSVSRTQPREVRRAASSVPNLK3CTNAQAAPKPRKAPSTSH^^ 

CGSPASQLVATPSLEIiGPIPAGEEKALELPIiAASSIPRPRTPSPESHRSPAEGSERLSLSPLRGGEAGPD 

ASPTVTTPTVTOPSLPAEVGSPHSTEVDESLSVSFEQVLPPSAPTSEA 
CTiVSPCEFEHRKAVPMAPAPASPGSSNDSSARSQERAGGLGAEETPPTSVSES 

DSDEDTEGFGVPRHDPLPDPIJKVPPPLPDPSSIC^IVDPE^^PPKTARQTENVSRTRXPI^ARPNSRAA 
ATPVAAAKTKGLAGGDRASRPLSARS EPSE KGGRAPLSRKS STPKTATRGPS GS ASS RPGVS ATP PKS PV 
YLDLAYLPSGSSAHLVDEEFFQRVRALCTYVISGQDQRKEEGMRAVIi 
VAMHTWYAETHARHQALGITVLGSNSMVSMQDDAFPACKVEF 
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Human VCY2IP1 Protein sequence - var5 (public gi: i39382ss) (seq id no: 3X9) 

DTO^SSTSVSPEQVLPPSAPTSE^SLPLRGPRARRSASPHDVDLCLVSPCEFEHRKAVPMAPAPASP 
MSNDSSARSQERAGGLGAEETPPTSVSBSLPTIiSDSDPVPIiAPGAADSDBDTEGPGV'P 

ARSEPSEKGGRAPLSRKSSTPKTATRGPSGSASSRPGVSATPPKSPVYLDLAYLPSGSSAHLVDEEFFQR 

^cwisgqSrkeegmravi^ai^ 
snsmvsmqddafpackvef 

Human VCY2IP1 Protein sequence- var6 (public gi: 14042429) (seq id no= 320) 

MAAVAGSGAAAAPSSLIjLVVGSEFGSPGLLTYV 
IVKGQRSLHHRGDNIjETLVLLOTSDKSLTO 

I/WLKDRE IRDI LATTPPPVQPPI LTI TCPTFGDWAQPAPAVPGLQGALRLQLRIjNPPAQLPNS EGLCEF 
MYVAESLEPPSPFELLEPPTSGGFLRIXSRPCCYIFPGGI^DAAFFAWGFTVLWGGSNPKSSFWKL^ 
HLDRVDAVLVTHPGADS LPGLNSLI1RRKI1AERSEVAAGGGSWDDRLRRI1I S PNLGWFFNACEAASRIiAR 
GEDEAELALSLLAQLGITPLPIjSRGPVPAKPTVLFEKMGVG 

AGPGEKVVRVLFPGCTPPACLLDGLVRLQHLRFLRE PWTPQDLEGPGRAES KESVGSRDSS KREGLLAT 

HPRPGQERPGVARKEPARAEAPRKTEKEAKTPRELRKDPKPSVSRTQPREVRRAASSV 

KPRKAPSTSHSGFPPVANGPRSPPSLRCGEASPPSAACGSPASQLVATPSLELGPIPAGEEKALELPIAA 

SSIPRPRTPSPESHRSPAEGSERLSLSPLRGGEAGPDASPTVTTPTVTTPSLPAEVGSPHSTEVDESLSV 

SFEQVLPPSAPTSEAGLSDPLRGPRARRSASPHDVDLCLVSPCEFEHRKAVPMAPAPASPGSSNDSSARS 

QERAGGLGAEETPPTSVSESLPTLSDSDPVPDAPGAADSDEDTEGFGVPRHDPLPDPLKVPPPLPDPSSI 

CMVDPEMLPPQDSTANGERQPHPEAPGPPQLTRCRPQSHSSGCCQNQGACWWGPCQPTTQCPE 

Human VCY2IP1 Protein sequence - var7 (public gi: 13623505) (seq id no: 321) 

MGVGRLDMYVLHPPSAGAERTLiACV CAIjIjVWHPAGPGEKVVRVLFPGCTP PACLLDGIjVRIjQHLRFLRE P 

WTPQDLEGPGRAESKESVGSRDSSKREGLLATHPRPGQERPGVARKEPARAEAPRKTEKEAKTPRE 

DPKPSVSRTQPREVRRAASSVPNLKKTNAQAAPKPRKAPSTSHSGFPPVANGPRSPPSLRCGEASPPSAA 

CGSPASQLVATPSLELGPIPAGEEKALEIiPIiAASSIPRPRTPSPESHRSPAEGSERLSLSPLRGGEAGPD 

ASPTVTTPTVTTPSLPAEVGSPHSTEVDESLSVSFEQVLPPSAPTSEAGIiSLPLRGPRARRSASPHDVDL 

CLVSPCEFEHRKAVPMAPAPASPGSSNDSSARSQERAGGLGAEETPPTSVSESLPTLSDSDPVPLAPGAA 

DSDEDTEGFGVPRHDPLPDPLKVPPPIjPDPSSICMVDPEMLPPKTARQTENVSRTRKPIjARPNSRAAAPK 

ATPVAAAKTKGLAGGDRASRPLSARSEPSEKGGRAPLSRKSSTPKTATRGPSGSASSRPGVSATPPKSPV 

YLDLAYLPSGSSAHLVDEEFFQRVRAL^ 

VAMHTWYAETHARHQALGITVLGSNS^SMQDDAFPACKVEF 

Human VCY2EP1 Protein sequence - var8 (public gi: 10434894) (seq id no: 322) 

MGVGRIjDI^r^LHPPSAGAERTIjASVCAIjLVWHPAGPGEKVVRVIiF PGCTP PACLIiDGIjVRIjQHLRFLREP 
VVTPQDLEGPGRAESKESVGSRDSSKR£GLIiATHPRPGQERPGVARKEPARAEAPRKTEKEAKTPRELKK 

dpkpsvsrtqprevrraassvpnlkktnaqaapkprkapstshsgfppvangprsppslrcgeasppsa^ 

CGSPASQLVATPSLELGPIPAGEEKALEIiPLAASSIPRPRTPSPESHRSPAEGSERLSLSPIiRGGEAGPD 
ASPTVTTPTVTTPSLPAEVGSPHSTEVDESLSVSFEQVLPPSAPTSEAGLSLPl^GPRARRSASPHDVDL 
CLVSPCEFEHRKAVPMAPAPASPGSSNDSSARSQERAGGLGAEETPPTSVSESLPT^ 
DSDEDTEGFGVPRHDPIiPDPLKTPPPLPDPS S I CMVDPEMI1PPKTARQTENVSRTRKPI1ARPNSRAAAPK 
ATPVAAAKTKGLAGGDRASRPLSARSEPSEKGGRAPLSRKSSTPKTATRGPSGSASSRPGVSATPPKSPV 
YIiDLAYLPSGSSAHLVDEELFQRVRAIjCYVI SGQDQRKEEGMRAVLDALIiASKQHWDRDLQVTIjI PTFDS 
VAMHTWYAETHARHQALG ITVLGSNSMVSMQDDAFPACKVEF 

Human VCY2IP1 Protein sequence - var9 (public gi: 7022844) (seq id no: 323) 

MGVGRLDMYVLHPPSAGAERTLASVCALLVWHPAGPGEKVVRVLFPGCTPPACI^ 

WTPQDLEGPGRAESKESVGSRDSSKREGLLATHPRPGQERPGVARKEPARAEAPRKTEKEAK^PRELKK 

DPKPSVSRTQPREVRRAASSVPNLKKTNAQAAPKPRKAPSTSHSGFPPVANGPRSPPSLRCGEASP 

CGS PASQLVATPSLELiGP I PAGEEKALELPLAAS S I PRPRTPS PESHRS PAEGS ERLS LS PLRGGEAGPD 
ASPTVTTPTVTTPSLPAEVGSPHSTEVDESLSVSFEQVLPPSAPTSEAGLSLPIiRGPRARRSi^ 

CLVSPCEFEHRKAVPMAPAPASPGSSITOSSARSQERAGGLGAEETP^ 

DSDEDTEGFGVPRHDPIjPDPIjICN^PPIjPDPSSICWVDPEMLPPKTARQTENVSRTRKPI^^ . 
ATPVAAAKTKGIiAGGDRASRPLSARSEPSEKGGRAPLSRKSSTPKTATRGPSGSASSRPGVSATPPKSPV 

YLDIiAYLPSGSSAHLVDEEFFQRVRALCYVISGQDQRKEEGMRAVLDALliA^ 
VAMHTWYAETHARHQALGITVL.GSNGMVSMQDDAFPACKVEF 
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Unigene Name; SPG20 Unigene ID; Hs . 118087 

Human SPG20 mRNA sequence - varl (public gi: 28436884) (SEQ ID NO: 367) 

GAGTCTGAACACAC^GGTACTGGGTGGGAATCTGCTAGACAGA^ 

MGGtcScAGATO^TGTAAATCCTGCAGGGGAGGTTAGTGCACCTTCGTATCCTGGGTA^ 

^^^^^^^^^^^^ 

GCACTGATGTGAAACAGTTGGACCAAGGCAATAAGGATGTACGTCATAAAGGAAAACGTGGAAAAAGGGC 
TAAAGA^ACTTCA^GTGAAGAAGTTAA.CCTGAGTCACATTGTACCIATGTGAGCC^ 

ga^cIaccagaagWaacccgtggaagttagtccagctgtcaccaagggactttatat^^ 
SIca^gSga^cagcaaaagtcagtcagttcc 

Saaaagaactagctccacatgtcaagaagcatggaagcaaacttgttccagaatot 

JScAtGGATTGGAATGTGCAGCTAAATGCATCGTTAACA^^ 

gSacaa^tacggatataatgcaggagaagctacccac^ 

SS^ACAATATTAACAACATTGGTATCAAAGCAATGGTGAAGAAAACTGCA^ 

a^ccSg^gmtatcagatagttgataattctcagagggaaaatcaagaaggagcagca^ 
?ctc^acttaaaaggaaaaga^^ 

aIgtSattStttccaagtgtagctaaaatatttttgcaggtaaaataaagct^^ 

^Saacc^Saaacctaatattgaactatttttatatctgct 

tatatatttgcttaaaaattggcatttagtaaccttaattct 

gtcccctcttttt<^^ 

aaaaactaraactaagaattctcaaaaagacttgttcaaaacagggaaagactgatgaa^ 
a^^t™aacttacctgtttgttaggaaatggaatggtttctttgatttaaaatgaataaaa^ 

attaaarttcgaagtgattattatcagcttaattggattaaaaaagtacttcaagaaattaaaaaaaaaa 
aaaaaaaaaaaataaaaaaaaaaaaaaa 

Human SPG20 mRNA sequence - var2 (public gi: 7023530) (SEQ ID NO: 368) 
agggagctctcgaggcaacgccggggcgcccgaggtctggaaggcgcagaaatggagcaagagccacaaa 

ATGGAGAACCTGCTGAAATTAA.GATCATCAGAGAAGCATATAAGAAGGCCTTTTTATTTGTO^CAAAGG 
TCTGAATACAGATGAATTAGGTCAGAAGGAAGAAGCAAAGAACTACT^ 

CTCAGAGGGATCAGCATTTCATCAAAAGAGTCTGAACACACAGGTCCTGGGTGGGAATCTGCTAG^ 

TGCAACAGAAAATGAAAGAAACTCTACAGAATGTACGCACCAGGCTGGAAATTCTA 

CACTTCTCTGCAGAATGATCTTCAGGAGGTGCCCAAGTTATATCCAGAAT 

GAAAAATTACCAGAGCCTCAGTCTTTTAGTTCAGCTCCT 

CTCCAAGTGCAGGGGCAGTTGCTGCACCTGCTTCTCTGTCTTT^ 

TCCTCCTGCTTATACTCCTCAAGCTGCTGAAGGTCACTAC^CTGTATCCTATGGAACAGATTCTGGGGAG 
TTTTCATCAGTTGGAGAGGAGTTTTATAGGAATCATTCTCAGCCACCGCCTCTTGAGACCTTAGGGCTGG 
ATGCAGATGAATTGATTTTGATACCAAATGGAGTACAGATTTTTTTTGTAAATCCTGCAGGGGAGGTTAG 
TGCACCTTCGTATCCTGGGTACCTTCGAATTGTGAGGTTTTTGGATAATTCTCTCGATACGGTTCTAAAC 
CGTCCTCCCGGGTTTC^CAGGTTTGTGACTGGTTATATCCTCTAGTTCC^ATAGATCTCC^TTCTGA 
AATGTACTGCGGGAGCCTACATGTTTCCTGATACAATGCTACAAGCAGCAGGATGCTTTGTGGGGGTCGT 
CCTGTCCTCTGAGTTACCAGAGGATGATAGAGAGCTCTTTGAGGATCTGTTAAGGCAAATGTCTGACCTT 
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■ ATATTGTTTATTAGQGAATACTGAGAAAGATAATATTTTGT^ 

■*£S52£S5SSESS2ES5S5SZ£z*^^ 



.TCAGATATTGTCATATAAGTATAAATTCTTCCTGGCCTA 



GAATCTAAAATTAGTTAXGTTCGTTTCCCCTTTGAAGCTCCCTCT 
GAATTGTGAGAAATAAATTTTTGTTGTTAAT 

Human SPG20 niRNA sequence - var3 (pubKc gi: 7023 938) (SEQ ED NO: 369) 

TAGCTCCTGMAGATCTCCGGTTCTGAAATGTACTGCGGGAGCCTACATGTTTCCT 
GATCTGTTAAGGCA^TGTCTGACCTTCGGCTCCAGGCCAACTGGAACAGAGCAGAAGAAG 

tccaaJJSga^aacSgaccctcctctgaccaactaaa^ 



^^^^^^^^^^^ 

g^gctaaatccItcgttaacaatgtttc^ 
?accaa?gccttatgWtggatgaaattttgttaaata^ 

SaTOTCAAATG^CATTC^ 
GGAAAMAATAAOTMTCTTGCA 

SgSgctaaaat^ 

Saatat?ga^ 

aSS^g^aac^ 

CTA^GTGGATTTTTTTAGATGCTTCTCAAAATTTTCAGTGTGTAAGCTAAACAAAA^ 
S^CTCAAAAAAACTTGTTCAAAACAGGGAAAGACTGATGAAAA 

ACCTGTTTGTTAGGAAATGGAATGGTCTCTTTGATTTAAAATGAATAAAAATAGATTATTACGTC 

Human SPG20 mRNA sequence - var4 (public gi: 16553694) (SEQ ID NO: 370) 
gIgcaIgagSac^ 

TATTTGTTAACARAGGTCTGAATACAGATGAATTAGGTCAGAAGGAAGAAGCA?^ 
AG(3AATAGGACAC^TCCTC^ 

gaatctgI^gacagatgcaacagaaaa^^ 

tcgaattgtgaggtttttggataattctctcgatacggttctaaaccgtcotcccggg^totcaggct 
tgt^ctggttatatcctctagttcctgatagatctccggttctgaaatgtactgcgggagc 

rcCCTGATACAATGCTACAAGCAGCAGGATGCTTTGTG^^ 
TGATAGAGAGCTCTTTGAGGATCTGTTAAGGCAAATGTCTGACCITCGG 

GCAGAAGAAGAAAATGAATTCa^AATCCCTGGAAGAACTAGACCCTCCTCTGACCAACTAAAA 
CTGGCACTGATGTGAAACAGTTGGACCAAGGCAATAAGGATGTACGTCATAAAGGAAAACGTGG 

GGCTAAAGATACTTCAAGTGA&GAAGTTAACCTGAGTCACATTGT^ 
AAGCCAAAAGAATTACCTGAATGt^TGAAAAAGTG^ 

GTTGGGGTTTAGTCAAAGGTGCTGAGATTACTGGTAAGGCAATCCAGAAAGGTGOT 
GCGGATTCAACCAGAAGAAAAACCCGTGGAAGTTAGTCCAGCTGTCaCCAAGGGACTTTATATAGCGAAG 
CAAGCTACAGGAGGAGCAGCAAAAGTCAGTCAGTTCC^TTGATGGAGTTTGCACTGTAGCAA^ 
rTGGAAAAGAACTAGCTCCACATGTCAAGAAGCATGGAAGCAAACTTGTTCCAGAATCTCTTAAAAAAGA 
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CAAAGATGGGAAATCTCCTCTGGATGGTGCTATG 

tcagma^^ 
tSctgc^ 

^TCTCCTTGAGG 

^CGTCAGAGGGGAGAAGGATGAGCAGACGA^ 
TCCTCA^^ 

GTTCAAACCTTGTTAAACCTAATATTGAACTATTTTTATATCT 
CTATATATTTGCTTAAAAATTGGCATTTAGTAACCTTAATTCI^ 
TGTCCCCTCTTTTTGCACTAATTGTGGATTTTTTTA6ATGCTTCTCAAAATTTTCA 
CAAAAACTAAAACTAAGAATTCTCAAAAAAACTTGTTCAAAA 

GACTACTTTTGTAACTTACCTGTTTGTTAGGAAATGGAATGGTCTCTTTGATTTAAAATGAATAAZ^AATA 
GATTATTACGTC 

Human SPG20 ruRNA sequence - var5 (public gi: 21654722) (SEQ ID NO: 371) 

ATGGAGCAAGAGCCACAAAATGGAGAACCTGCTGAAA^ 

TTTTATTTGTTAACAAAGGTCTGAATACAGATGAATTAGGTCAGAA 

GCAAGGAATAGGACACCTGCTCAGAGGGATGAGCATTTCATCAAAAGAGT 

TGGGAATCTGCTAGACAGATGCAAC^GAAAATGAAAGAAACTCTACAGAATGTACGCACCAGGCTGGAJ^ 
TTCTAGAGAAGGGTCTTGCCACTTCTCTGCAGAATGATCTT 

TCCACCTAAAGACATGTGTGAAAAATTACCAGAGCCTCAGTCTTTTAGTTCAGCTCCTCAGCATGCTGAA 
GTAAATGGAAACACCTCAACTCCAAGTGCAGGGGCAGTTGCTGCACCTGCTTCTCTGTCTTTACCATCAC 

AAAOTTGTCCAGCAOAM^ 

TGGAACAGATTCTGGGGAGTTTTCAT(^GTTGGAGAGGAGTTTTATAGGAATCATTCTCAGCCACCGCCT 

CTTGAGACCTTAGGGCTGGATGCAGATGAATTGATTTTGATACC7VAATGGAGTACAGATTTTTTTTGTAA 

ATCCTGCAGGGGAGGTTAGTGCACCTTCGTATCCTGGGTACCTTCGAATTGTGAGGTTTTTGGATAATTC 

TCTCGATACGGTTCTAAACCGTCCTCCCGGGTTTCTTCAGGTTTGTGACTGGTTATATCCTCTAGT 

GATAGATCTCCGGTTCTGAAATGTACTGCGGGAGCCTACATGTTTCCTGATACAATGCTACAAGCAGCAG 

GATGCTTTGTGGGGGTCGTCCTGTCCTCTGAGTTACCAGAGGATGATAGAGAGCTCTTTGAGGATCTGTT 

AAGGCAAATGTCTGACCTTCGGCTCCAGGCCAACTGGAAGAGAGCAGAA 

CCTGGAAGAACTAGACCCTCCTCTGACCAACTAAAAGAAGCCTCTGGCACTGATGTGAAACAGTTGGACC 
AAGGCAATAAGGATGTACGTC^TAAAGGAAAACGTGGAAAAAGGGCTAAAGATACTTCAAGTGAAGAAGT 
TAACCTGAGTCACATTGTACCATGTGAGCGAGTTCCAGAAGAAAAGCCAAAAGAATTACCTGAATGGAGT 
GAAAAAGTGGCTCACAACATTTTGTCAGGTGCTTCCTGGGTGAGTl^GGGTTTAGTCAAAGGTGCTGAGA 
TTACTGGTAAGGCAATCCAGAAAGGTGCTTCTAAACTCCGAGAGCGGATTCAACCAGAAGAAAAACCCGT 
GGAAGTTAGTCCAGCTGTCACCAAGGGACTrTATATAGCGAAGCAAGCTAC^GGAGGAGCAGCAAAAGTC 
AGTCAGTTCCTGGTTGATGGAGTTTGCACTGTAGCAAATTGCGTTGGAA7VAGAACTAGCTCCACATGTCA 
AGAAGCATGGAAGCAAACTTGTTCCAGAATCTCTTAAAAAAGACAAAGATGGGAAATCTCCTCTGGATGG 

TGCTATGGTTGTAGCAGCAAGTAGTGTTCAAGGATTTTCAACTGTCTGGCAA^ 

AAATGCATCGTTAACAATGTTTCAGCAGAAACTGTACAAACTGTCAGATACAAATACGGATATAATC 

GAGAAGCTACCCACCATGCGGTGGATTCTGCGGTCAATGTTGGCGTAACTGCCTACAATATTAACAACAT 

TGGTATCAAAGCAATGGTGAAGAAAACTGCAACACAAACAGGACACACTCTCCTTGAGGACTATCAGATA 

GTTGATAATTCTCAGAGGGAAAATCAAGAAGGAGCAGCAAATGTCAACGTGAGAGGGGAGAAGGATGAGC 

AG ACG AAG GAAGT AAAGG AGGCAAAG AAG AAAGAT AAATG A 

Human SPG20 niRNA sequence - var6 (public gi: 22074831) (SEQ ID NO: 372) 

GCGGCCGCGCAGGGAGCTCTCGAGGCAACGCCGGGGCGCCCGAGGTCTGGAAGGCGCAGAAATGGAGCAA 

GAGCGACAAAATGGAGAACCTGCTGAAATTAAGATCAT CAGAGAAG CATATAAGAAGGCCTTTTTATTTG 
TTAACAAAGGTCTGAATACAGATGAATTAGGTCAGAAGGAAGAAGCAAAGAACTACTATAAGCAAGGAAT 

AGGACACCTGCTCAGAGGGATCAGCATTTCATCAAAAGA 

GCTAGAGAGATGCAACAGAAAATGAAAGAAACTCTACAGAAT^ 

AGGGTCTTGCCACTTCTCTGCAGAATGATCTTCAGGAGGTO 

AGACATGTGTGAAAAATTACCAGAG CCTCAGT CTTTTAGTTCAGCTCCTCAGCATG CTGAAGTAAATGGA 

AACACCTCAACTCCAAGTGCAGGGGCAGTTGCTGCACCTGCTTCTCTGTCTTTACCAT 

CAGCAGAAGCTCOTCCTGCTTATACTCCTCAAGCTGCTGAAGGTCA 

TTCTGGGGAGTTTTCATC^GTTGGAGAGGAGTTTTATAGGAATCATTCTCAGCCACCGCCTCTTGAGACC 
TTAGGGCTGGATGCAGATGAATTGATTTTGATACCAAATGGAGTACAGATTTTTTTTGTAAATCCTGCAG 
GGGAGGTTAGTGCACCTTCGTATCCTGGGTACCTTCGAATTGTGAGGTTTTTGGATAATTCTCTCGATAC 
GGTTCTAAACCGTCCTCCCGGGTTTCTTC^GGTTTGTGACTGGTTATATCCTCTAGTTCCTGATAGATCT 
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TGGGGGTCGT^ 

GTCTGACCTTCGGOTCCAGGCCAACTGGAAC^ 
ACTAGACCCTCCTCTGACCAACTAAAAGAAGCCTCTGGCACT 

AGGATGTACGTCATAAAGGAAAACGTGGAAAAAGGGCTAAAGATACTTCAAGTGAAGAAGTTAACCTGAG 

TCACATTGTACCATGTGAGCCAGTTCCAGAAGAAAAGCCT^ 

GCTCACAACATTTTGTCAGGTGCTO 

AGGCAATCCAGAAAGGTGCTT CTAAACTC CGAGAGCGGATTCAACCAGAAGAAAAACCCGTGGAAGTTAG 
TCCAGCTGTCACCAAGGGA^ 

CTGGTOGATGGAGTTTGGACTGTAGCAAATTGCGTTGGAAAAGAACTAGCTCC^ 
GAAGCAAACTTGTTCCAGAATCTCTTAAAAAAGACAAAGATGGGAAA 
TGTAGCAGCAAGTAGTGTTCAAGGATTTTCAACTGTCTGGCAAGGA^ 
GTTAACAATGTTTGAGCAGAAACTGTACAAACTGTC 

CC CACCATGCGGTGGATTCTG CGGT CAATGTTGGCGTAACTGC CTACAATATTAACAACATTGGTATCAA 
AGCAATGGTGAAGAAAACTGCAACACAAACAGGACACACTCTCCTTGAGGACTATC^GATAGTTGATAAT 
TCTCAGAGGGAAAATCAAGAAGGAGCAGCAAATGTCAACGTGAGAGGGGAGAAGGATGAGGAGACGAAGG 
AAGTAAAGGAGGCAAAGAAGAAAGATAAATGATGAAGTGCTGGGAATCACTTATACCAAAGCCTTATGAA 
ATGGATGAAATTTTGTTAAATAGGCAAATGTGGAATTCCTCACAGATTAACCAGTATTTTTTAAATGTAT 
TCATTCCTACAAATTAACTTTCATAAATTTTATGGCATGTCTTCTATTTAAAAGGAAAAGT^ATAAGTATT 
CTTGCATCTGGCCTTAGAAATGTGAAGTTATATTCTCAAGTTTATTTTTTTCCAAGTGTAGCTAAAATAT 
TTTTGCAGGTAAAATAAAGCTGATAGT ACATGTGTTGTT CAAACCTTGTT AAAC CT AAT ATTGAACTATT 
TTTATATCTGCTGTCTTTCAGAAGGCAAATAGGAAACTATATATTTGCTTAAA^ 

CTTAATTCTTTTTATAGAAGGAATGACTTAAAGTATTGTCCCCTCTTTTTGCACTAATTGTGGATTTTTT 
TAGATGCTTCTCAAAATTTTCAGTGTGTAAGCTAAACA 

GTTCAAAACAGGGAAAGACTGATGAAAAGTAAAATGGACTACTTTTGTAACTTACCTGTTTGTTAGGAAA 
TGGAATGGTCTCTTTGATrTAAAATAAATAAAAATAGATTATTACGTCTTTTGTATTGAGACTGTATTGT 
TATGAGCCTAGGAAATTTGGGAAGATGATTGTATTGTATTAAAATTCGAAGTGATTATTATCAGCTTAAT 

TGGATTAAAAAAGTACTTCAAGAAATTATTT^ 

GTAAAAAAAATATATATAAACAATTGAGTTTACTAATGGTAAAGATTTTTATTCTGGGATTCGGTCATTG 

GAATTTATATTAAAAGACAAGTTATTAAAAAGGAAAGGTTCTATTCATAATCAGGGTAAAGAATATGAAA 

ACCTTAGACGTAATCCATGGTGGATAGGCATTATGGTTTCCACTTTGGCAGAAGGCAGA 

CCTATTTACTTACATAGGCTAAAAAACTATGTAACTAAATACCTAATGGTATTTAATTTTTGTTTATTGA 

ATTTAAGAGATTGGTATTAGTTTTCATAG CTGTAGTCCATT CTAATAATTT CTGATCTTCTAGTGGCTAC 

TTAATTAGACATTATTTGAAGCTGTCTGAAGAATGCACTrTATGAATTAAAAAACTGA^ 

CGTTATCACATGAGCTTATATTTTGGGAACACATAGAACTGATGGAGGCTTTTCCTAAGGCCAAGGATAA 

TGTACTAGTTGTTAAAATGGAAATAAAAGTGAAGTGGTAAAT 

Human SPG20 mRNA sequence - var7 (public gi: 20070809) (SEQ ID NO: 373) 

GGCGGCGGCGTGCTGCGGGCTCTGTGGCGGGAGCGAGGCCGACGGGCGGGGCCGTGCGGCCGCGTGACGC 
GAAGCGTTCGAGAGCGCGCGTCGTGGAACGTCTTGGTTGCCACGGCAAGCGCGCGCGCGAGGCCTTGGGA 
ACCTCGGGACCGGCCCCCGGCGAGCGCAGCGGCGCCCAGTAGTCATCTTAGTGGGATTTGGGGAAGCAAC 
AGGGCTGTGTGGGGTAACCTGCCACCTTTAAGTGGAAATCAGAAATC 

ACCTGCTGAAATTAAGATCATCAGAGAAGCATATAAGAAGGCCTTTTTATTTGTTAACAAAGGTCTGAAT 
ACAGATGAATTAGGTCAGAAGGAAGAAGCAAAGAACTACTATAAGCAAGGAATAGGACACCTGCTCAGAG 
GGATCAGCATTTCATCAAAAGAGTCTGAACAC^CAGGTCCT 

GAAAATGAAAGAAACTCTACAGAATGTACX3CACCAGGCTGGAAATTCTAGAGAAGGGTC 
CTGCAGAATGATCTTGAGGAGGTGCCCAAGTTATATCCAGAATTTCCACCTAAAGACATGTGTGAAAAAT 
TACCAGAGCCTCAGTCTTTTAGTTGAGCTCCTCAGCATGCTGAAGTAAATGGAAACACOT 
TGCAGGGGCAGTTGCTGCACCTGCTTCTCTGTCT 

GCTTATACTCCTC^AGCTGCTGAAGGTCACTACACTGTATCCTATGGAACAGATTCTGGGGAGTTTTCAT 
CAGTTGGAGAGGAGTTTTATAGGAATCATTCTCAGCCACCGCCTCnTGAG 

TGAATTGATTTTGATACCAAATGGAGTACAGATTTTTTTTGTAAATCCTGCAGGGGAGGTTAGTGCACCT 

TCGTATCCTGGGTACCTTCGAATTGTGAGGTTTTTGGATAATTCTCTCGATACGGTTCTAAACCGTCCTC 

CCGGGTTTCTTCAGGTTTGTGACTGGTTATATCCTCTAGTTCCTGATAGATOTCCGGTTCTGAAATGTAC 

TGCGGGAGCCTACATGTTTCCTGATACAATGCTACAAGCAGCAGGATGCTTTGTGGGGGTCGT 

TCTGAGTTACC^GAGGATGATAGAGAGCTCTTTGAGGATOTGTTAA 

AGGCCAACTGGAACAGAGCAGAAGAAGAAAATGAATTCCAAATCCCTGGAAGAACTAGACCCTCCTCTGA 
CCAACTAAAAGAAGCCTCTGGCACTGATGTGAAAC^ 

GGAAAACGTGGAAAAAGGGCTAAAGATACTTCAAGTGAAGAAGTTAACCTGAGTCACATTGTACCATGTG 

AGCCAGTTCCAGAAGAAAAGCCAAAAGAATTACCTGAATGGAGTGAA 
AGGTGCTTCCTGGGTGAGTTGGGGTTTAGTCAAAGGTGCTGA 

GCTTCTAAACTCCGAGAGCGGATTCAACCAGAAGAAAAACCCGTGGAAGTTAGTCCAGCTGTCACCAAGG 
GACTTTATATAGCGAAGCAAGCTACAGGAGGAGGAGCAAAAGTCAGT^ 
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Sc^GG^^TGGCCTAACTGCra 

SaagIScaI^^ 
^4aIgaaagataaatgatgaagtgctgggaatcactc^^ 

^AAA^AG^CAAA^GTGGAATTCCTCACAGATTAACCA 

J™^^; ttatggcatgtcttC tatttaaaaggaaaagaataagta 

GAAATGTGAAGTTATATTCTCAAGTTTATTTTTTTCCAAGTGTAGCTAAAA 

AAGCTGATAGTACATGTGTTGTTCAAACCTTGTTAAACCTAATATTGAACTATTTTTATATCTGCTCTCT 

TTCAGAAGGCAAATAGGAAACTATATATTTGCTTAAAAATTGGCATTTAGTAACCTTA , 

GAftGGAATGACTTAAAGTATTGTCCCCTCTTTTTGCACTAATTGTGGATTTTTTTAGATGCTTCTCA 

TTTTCAGTGTGTAAGCTAAACAAAAACTAAAACTAAGAATTCTCAAAAAAACTTGTTCAAAACAGGGAAA 

GACTGATGAAAAGTAAAATGGACTACTTTTGTAACTTACCTGTTTGTT^ 

ATTTAAAATGAATAAAAATAGATTATTACGTCTTTTGTATTGAGACTGTATTGTTATGAGCCTAGGAAAT 
TTGGGAACATGATTGTATTGTATTAAAATTCGAAGTGATTATTATCAGCTTAATTGGATTAAAAAAGTAC 

TTCAAGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Huraan SPG20 mRNA sequence - var8 (public gi: 3043743) (SEQ JJNO: 374) 

GCX3GCCGCGCAGGGAGCTCTCGAGGCAACX3CCGGGGCGCCCGAGGTCTGGAAGGCGCAGASATGGAGCAA 

GAGCCACAAAATGGAGAACCTGCTGAAATTAAGATCATCAGAGAAGCATATAAGAAGGCCTTTTTATT 

TTAACAAAGGTCTGAATACAGATGAATTAGGTCAGAAGGAAGAAGCAAAGAACTACTATAAGCAAGGAAT 

AGGACACCTGCTCAGAGGGATCAGCATTTCATCAAAAGAGTCTGAACACACAGGTCCTGGGTCGGAATCT 

GCTAGACAGATGCAACAGAAAATGAAAGAAACTCTACAGAATGTACGCACCAGGCTGGAAA^raCTAGAGA 

AGGGTCTTGCCACTTCTCTGCAGAATGATCTTCAGGAGGTGCCCAAGTTATATCCAGAATTTCCACCTAA 

AGACATGTGTGAAAAATTACCAGAGCCTCAGTCTT^^ 

AACACCTCAACTCCAAGTGCAGGGGCAGTTGCTGCACCTGCTTCTCTGTCTrTTACCATCACAAAGTTGTC 

CAGCAGAAGCTCCTCCTGCTTATACTCCTCAAGCTGCTGAAGGTCACTACACTGTATCCTATGGAACAGA 

TTCTGGGGAGTTTTCATCAGTTGGAGAGGAGTTTTATAGGAATCATTCTCAGCCACCGCCTCTTGAGACC 

TTAGGGCTGGATGCAGATGAATTGATTTTGATACCAAATGGAGTACAGATTTTTTTTGTAAATCCTGCAG 

GGGAGGTTAGTGCACCTTCGTATCCTGGGTACCTTCGAATTGTGAGGTTTTTGGATAATTCTCTCGATAC 

GGTTCTAAACCGTCCTCCCGGGTTTCTTCAGGTTTGTGACTGGTTATATCCTCTAGTTCCTGATAGATCT 

CCGGTTCTGAAATGTACTGCGGGAGCCTACATGTTTCCTGATACAATGCTACAAGGAGCAGGATGCTTTG 

TGGGGGTCGTCCTGTCCTCTGAGTTACCAGAGGATGATAGAGAGCTCTTTGAGGATCTGTTAAGGCAAAT 

GTCTGACCTTCGGCTCCAGGCCAACTGGAACAGAGCAGAAGAAGAAAATGAATTCCAAATCCCTGGAAGA 

ACTAGACCCTCCTCTGACCAACTAAAAGAAGCCTCTGGCACTGATGTGAAACAGTTGGACCAAGGCAATA 

AGGATGTACGTCATAAAGGAAAACGTGGAAAAAGGGCTAAAGATACTTCAAGTGAAGAAGTTAACCTGAG 

TCACATTGTACCATGTGAGCCAGTTCCAGAAGAAAAGCCAAAAGAATTACCTGAATGGAGTGAAAAAGTG 

GCTCACAACATTTTGTCAGGTGCTTCCTGGGTGAGTTGGGGTTTAGTGAAAGGTGCTGAGATTACTGGTA 

AGGCAATCCAGAAAGGTGCTTCTAAACTCCGAGAGCGGATTCAACCAGAAGAAAAACCCGTGGAAGTTAG 

TCCAGCTGTCACCAAGGGACTTTATATAGCGAAGCAAGCTACAGGAGGAGCAGCAAAAGTCAGTCAGTTC 

CTGGTTGATGGAGTTTGCACTGTAGCAAATTGCGTTGGAAAAGAACTAGCTCCACATGTCAAGAAGCATG 

GAAGCAAACTTGTTCCAGAATCTCTTAAAAAAGACAAAGATGGGAAATCTCCTCTGGATGGTGCTATGGT 

TGTAGCAGCAAGTAGTGTTCAAGGATTTTCAACTGTCTGGCAAGGATTGGAATGTGCAGCTAAATGCATC 

GTTAACAATGTTTCAGCAGAAACTGTACAAACTGTCAGATACAAATACGGATATAATGCAGGAGAAGCTA 

CCCACCATGCGGTGGATTCTGCGGTCAATGTTGGCGTAACTGCCTACAATATTAACAACATTGGTATCAA 

AGCAATGGTGAAGAAAACTGCAACACAAACAGGACACACTCTCCTTGAGGACTATCAGATAGTTGATAAT 

TCTCAGAGGGAAAATCAAGAAGGAGCAGCAAATGTCAACGTGAGAGGGGAGAAGGATGAGCAGACGAAGG 

AAGTAAAGGAGGCAAAGAAGAAAGATAAATGATGAAGTGCTGGGAATCACTTATACCAAAGCCTTATGAA 

ATGGATGAAATTTTGTTAAATAGGCAAATGTGGAATTCCTCACAGATTAACCAGTATTTTTTAAATGTAT 

TCATTCCTACAAATTAACTTTCATAAATTTTATGGCATGTCTTCTATTTAAAAGGAAAAGAATAAGTATT 

CTTGCATCTGGCCTTAGAAATGTGAAGTTATATTCTCAAGTTTATTTTTTTCCAAGTGTAGCTAAAATAT 

TTTTGCAGGTAAAATAAAGCTGATAGTACATGTGTTGTTCAAACCTTGTTAAACCTAATATTGAACTATT 

TTTATATCTGCTGTCTTTCAGAAGGCAAATAGGAAACTATATATTTGCTTAAAAATTGGCATTTAGTAAC 

CTTAATTCTTTTTATAGAAGGAATGACTTAAAGTATTGTCCCCTCTTTTTGCACTAATTGTGGATTTTTT 

TAGATGCTTCTCAAAATTTTCAGTGTGTAAGCTAAACAAAAACTAAAACTAAGAATTCTCAAAAAAACTT 

GTTCAAAACAGGGAAAGACTGATGAAAAGTAAAATGGACTACTTTTGTAACTTACCTGTTTGTTAGGAAA 

TGGAATGGTCTCTTTGATTTAAAATGAATAAAAATAGATTATTACGTCTTTTGTATTGAGACTGTATTGT 

TATGAGCCTAGGAAATTTGGGAACATGATTGTATTGTATTAAAATTCGAAGTGATTATTATCAGCTTAAT 

TGGATTAAAAAAGTACTrCAAGAAATTATTTTATCATATCTGCTTCTGTTTTTCCAAAAGGTTAAAACTT 

GTAAAAAAAATATATATAAACAATTGAGTTTACTAATGGTAAACATTTTTATTCTGGGATTCGGTCATTG 
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GAATTTATATTAAAAGAG^ 

ACCTTAGACGTAATCCATGGTGGATAGGCATTATGGTTTCCACTTTG^ 
CCTATTTACTTACATAGGCTA^ 

ATTTAAGAGATTGGTATTAGTTTTCATAGCTGTAGTCCATTCT^ 

TTAATTAGACATTATTTGAAGCTGTCTGAAGAATGCACTTTATGAATTAAAAAACTGAATTGCCTGACCT 
. CGTTATCACATGAGCTTATATTTTGGGAAC^CATAGAACTGATGGAGGCTTTT 
TGTACTAGTTGTTAAAATGGAAATAAAAGTGAAGTGGTAAAT 

Human SPG20 protein sequence - varl (public gi: 28436885) (SEQ ID NO: 386) 

MEQEPQNGEPAE I KI I REAYKKAFLFVNKGLNTDELGQ KEEAKNYYKQG I GHLLRGI S I S SKESEHTGTG 
VTOSARQMQQKMKETI^NVRTRLEILEKGLATC^ 

WGNTSTPSAGAVAAPASLSLPSQSCPAE^PAYTPQAAEGHYTVSYGTDSGEFSSVGEEFYRNHSQPPP 
LETLGLDADELILI PNGVQI F FVNPAGEVSAPS YPGYLRI VRFLDNSLDTVLNRPPGFLQVCDWLYPLVP 
DRSPVLKCTAGAYMFPDTMLQAACTCFVGV^^ 
PGRTRPSSDQLKEASGTDVKQLDQGNKDVRH^ 

EKVAHNI LSGAS WVS WGLVKGAE I TGKAI QKGAS KLRER I QPEEKP VEVS PAVTKGLY I AKQ ATGGAAKV 
SQFLVDGVCIVANCVGKELAPHVKKHGSKLVPESLKK^ 

KCI VNNVSAETVQTVRYKYGYNAGEATHHAVDSAVNVGVTAYNINNI GI KAMVKKTATQTGHTLLEDYQ I 
VDNSQRENQEGAANVNVRGEKDEQTKEVKEAKKKDK 

Human SPG20 protein sequence- var2 (public gi: 22074832) (SEQ ID NO: 387) 

MEQEPQNGEPAEI KI IREAYKKAFLFVNKGLNTDEIjGQKEEAKNYYKQGIGHLLRGI S I SSKESEHTGPG 
WESARQMQQKMKETM^^ 

VNGNTSTPSAGAVAAPASLSLPSQSCPAEAPPAYTPQAAEGHYTVSYGTDSGEFSSVGEEFYRNHSQPPP 
LETLGLDADELILI PNGVQIFFVNPAGEVSAPSYPGYLRIVRFLDNSLDWLNRPPGFLQVCDWLYPLVP 
DRSPVLKCTAGAYMFPDTMLQAAGCFVGVVLSSELPEDDRELFEDLLRQMSDLRLQA^^ 
PGRTRPSSDQLKEASGTDVKQLIX5GNKDVRHKGKRGKRAKDTSS 

EKVAHNILSGASWSWGLVKGAEITGKAIQKGASKLRERIQPEEKPVEVSPAVTKGLYIAKQATGGA^ 
SQFLVDOTCTVANCVGKELAPHVKKHGSKLVPESLKKD 

KCI VNNVS AETVQTVRYKYGYNAGEATHHAVDS AVNVGVTAYNINNI GI KAMVKKTATQTGHTLLEDYQI 
VDNSQRENQEGAANVNVRGEKDEQTKEVKEAKKKDK 

Human SPG20 protein sequence - var3 (pubHc gi: 3043744) (SEQ ID NO: 388) 

RPRRELSRQRRGARGLEGAEMEQEPQNGE PAEI KI I REAYKKAFLFVNKGLNTDELGQ KEEAKNYYKQGI 
GHLLRGI S I S S KES EHTGPGWES ARQMQQKMKETLQNVRTRLE I LEKGLATSLQNDLQEVPKLYPEFP PK 
DMCEKLPEPQSFSSAPQHAEVNGNTSTPSAGAVAAPASLSLPSQSCPAEAPPAYTPQAAEGHYTVSYGTD 
SGEFSSVGEEFYRNHSQPPPLETLGLDADELI LI PNGVQI FFVNPAGEVS APS YPGYLRI VRFLDNSLDT 
VLNRPPGFLQVCTV^YPLVPDRSPVLKCTAGAYMFPDTMLQAAGCFVGWLSSELPEDDRELF 
SDLRLQANWNRAE EENEFQ I PGRTRPSSDQLKEASGTDVKQLDQGNKDVRHKGKRGKRAKDTSSE EVNLS 
HIVPCEPVPEEKPKELPEWSEKVAHNILSGASWVSWGLVKGAEITGKAIQKGASKLRERIQPEEKPVEVS 
PAVTKGLYI AXQATGGAAKVSQFLVIX^CTVANCVGKEIjAPHVKKHGSKLVPES LKKDKDGKS PLDGAMV 
VAASSVQGFSTWQGLECAAKCIVNISrVSAETVQ 

AMVKKTATQTGHTLLEDYQIVDNSQRE^QEGAANVOTRGEKDEQTKEVKEA 

Unigene Name: WASF1 Unigene ID: Hs. 75850 

Human WASF1 mRNA sequence - varl (public gi: 4507912) (SEQ ID NO: 375) 

CTTCTCTTGCACTTGCGGATGATGAACTGGAATAACGATO 

GTCCTGCCCTATAACCGATTAATTAATTGATCCCCAGCTAGACTAGTGTTGGAGAAATCAGCATGTTAAA 

ACAACTGTTGATGATAGCTGTTGGAGTAAAGTTGCAGTGGAAGCTATGGCTGCAA^ 

CAAGGTGAACTGGCACAAAGGTTAATCTCAAGATGCCGCTAGTGAAAAG 

GTGCCACAC^GCACTGCCTAGAGGCATTAAGAATGAACTGGAATGTGTAACCAATATTTCCTTGGCT^T 

ATAATTAGACAACTAAGTAGCCTAAGTAAATATGCTGAAGATATATTTGGAGAATTAT^ 

ATAGTTTTTCCTTGAGAGTCAACTCATTGGAAGAACGTGTGGACCGTTTATCTGTTAGT 

TGATCCAAAGGAAGAAGAATTGTCTTTGC^GATATAACAATGAGGAAAGCTTTCCGA^ 

CAAGACCAGCAGCTTTTCGATCGCAAGACTTTGCCTATTCCATTACAGG 

AGCCTCCACCTCTCAATATACTCACTCCTTATAGAGATGATGGTAAAGAAGGTCTGAAGTTTTATACCAA 
TCCTTCGTATTTOTTTGATCTATGGAAAGAAAAAATGTTGCAAGATACAGAGGATAAGAGGAAGGAAAAG 
AGGAAGCAGAAGCAGAAAAATCTAGATC^TCCTCATGAACGAGAAAAAGTG 

GGCGGCGAGAATGGCAGAAGCTGGCCCAAGGTCCAGAGCTGGCTGAAGATGATGCTAATCTCTTACATAA 
GCATATTGAAGTTGCTAATGGCCCAGCCTCTCATTTTGAAA^^ 
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GATGGATCTTACTCACTTTCTGCCTTGCCATTTAGTCAGATGAGTQAGCTTCTGACTAGAGCTGAGGAAA 
ACCCACCTGTATCAGTTCTGCTACAGGTTTGATAGAAAATCGCCC 

GGAAGCTCCA^GTACCACCACCTCCAGCTCCTCTTCAGATTGCCCCTGGA^ 

TACCAGTTCATCCACTCCCACAAGGTGAAGTTCAGGGGCTGCCTCCACCCCCACCA 

TCCACCTGGCATTCGACCATCATCACCTGTCACAGTTAC^GCTCr^ 

cSScCA^ACTGCCCCAGGTC^ 

c^cSaSc^cgccatccatcaaccctacctgtaatcagtgatg^^ 

AGCAATACGAAAAGGTATTCAGCTACGCAAAGTAGAAGAGCAGCGTGAACAGGAAGCTAAGCATGAACGC 

m^gaaaaog^^ 

CAGAA^TTGATGAAGTAGATTGGTTGGAGTAAGAAAAATGCATTGATAAATATTACAAAACTG 

mgtcctctgtcmtgcttgttcct^ 

TAAATGACCCTTTTCCTCCATAACTTTTGATTTCTAAGGAAAA^ 
TTTACAGTGG CTTATC 



CATGGACAGCTGTTGTTAGAGTAGCAGATTCAGTTTTTTGATATATC^ 

taatttaaagjLuvgc 

TCCTGATGTOTTAAAAGAAGAAACACTGCCTTGATTATACGAATAC^ 

GTTAAAAACTTTTCCATGTGAAATACTCTGACTTAAACATACATGTAACTTACATAACTGTTAAGAAT^ 
CAGTCTGATTTAATAAATGGTTCATTTTAAAAGTT 

Human WASF1 mRNA sequence - var2 (Public gi: 4927209) (SEQro NO: 376) 

ATGCCGCTAGTGAAA^GAAACATCGATCCTAGGCACTTGTGCCACACAG CACTGCCTAGAGG ^^^^5^ 

'atgaactggaatgtgtaaccaatatttccttggcaaatataattagacaactaagtagc 

TGCTGAAGATATATTTGGAGAATTATTCAATGAAGGACATAGTTTTTCCTTCAGAGTC^UV 

GAACGTGTGGACCGTTTATCTGTTAGTGTTACAC^GCTTGATCCAAAGGAAGAAG^ 
ATATAACAA.TGAGGAAAGCTTTCCGAAGTTCTACAATTCAAGACCAGCAGCTTTTCGATCGCAAGACT 

GCCTATTCCATTACAGGAGACGTACGATGTTTGTGAACA 
AGAGATGATGGTAAAGAAGGTCTGAAGTTTTA^ 

AAATGTTGCAAGATACAGAGGATAAGAGGAAGGAAAAGAGGAAGCAGAAGCAGAAAAATCTAGATCGTCC 

TCATGAACCAGAAAAAGTGCCAAGAGCACCTGATGACAGGCGGCGAGAATGGCAGAAGCT^ 

CCAGAGCTGGCTGAAGATGATGCTAATCTCTTACATAAGCATATTGAAGTTGCTAATGGCCC^GCCTCTC 

ATTTTGAAACAAGACCTCAGACATACGTGGATCATATGGA'TGGATCTTACTCACTTTCTGC^ 

TAGTCAGATGAGTGAGCTTCTGACTAGAGCTGAGGAAAGGGTATTAGTCAGACCAC^TGAACCACCTCCA 

CCTCCACCAATGCATGGAGCAGGAGATGCAAAACCGATACCCACCTGTATCAGTTCTGCTACAGGTTTGA 

TAGAAAATCGCCCTCAGTCACCAGCTACAGGCAGAACACCTGTGTTTGTGAGCCCCACTCCCCCACCTCC 

TCCACCACCTCTTCCATCTGCCTTGTCAACTTCCTCATTAAGAGCTTCAATGACTTC^CTCCTCCCCCT 

CCAGTACCTCCCCCACCTCC^CCTCCAGC^ 
CTCTTCAGATTGCCCCTGGAGTTCTTCACCCAGCTCCT^ 

TCCACCAGTAGCTAGAGCTGCCCCAGTATGTGAGACTGTACCAGTTCATCCACTCCCACAAGGTGAAGTT 
CAGGGGCTGCCTCCACCCCCACCACCGCCTCCTCTGCCTCCACCTGGCATTCGACCATCATCACCTGTCA 
CAGTTACAGCTCTTGCTCATCCTCCCTCTGGGCTACATCCAACTCCATCTACTGCCCCAGGTCCCCATG^ 

TCCATTAATGCCTCCATCTCCTCCATCACAA 

CTACCTGTAATCAGTGATGCCAGGAGTGTGCTACTGGAAGCAATACGAAAAGGTATTCAGCTACGCAAAG 
TAGAAGAGCAGCGTGAACAGGAAGCTAAGC^TGAACGCATTGAAAACGATGTTGCCACC^TCCTGTCTCG 
CCGTATTGCTGTTGAATATAGTGATTCGGAAGATGATTCAGAATTTGATGAAGTAGATTGGTTGGAGTAA 

GAAAAATGCATTGATAAATATTAC^AAACTGAATGCAAATGTCCTO^ 
TTTGGTCA 

Human WASF1 protein sequence - varl (public gi: 4507913) (SEQ ID NO: 389) 

MPLVKRNIDPRHLCOTALPRGIKNELECVTNIS 

ERVDRLSVSVTQLDPKEEELSLQDITMRKAFRSSTIQDQQLPDRKTLPIPIiQETYD 

RDDGKEGLKFYTNPSYFFDLWKEKMIjQDTEDKRKEKRKQKQ 

PELAEDDANIiliHKHIEVANGPASHFETRPQTYVDHMDGSYSLSAIiPFSQM 

PPPMHGAGDAKPIPTCXSSATGLIEimPQSPATGRTPVFVSPTPPPPPPPI.PSAIiSTSSIiRASMTSTPPP 
PVPPPPPPPATAWAPAVPPPPAPLQIAPGVLHPAPPPIAPPLVQPSPPVARAAPVCETVPVHPLPQGE^ 
QGLPPPPPPPPLPPPGIRPSSPVTVTALAHPPSGLHPTPSTAPGPHVPLMPPSPPSQVIPASEPKRHPST 
LPVISDARSVLLEAIRKGIQLRKVEEQREQEAICHERIENDVATILSRRIAVEYSDSEDDSEFDEVDWLE 
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Unigene 'Name; HIP-55 Unigene IP: Hs.183373 
.Human tDP-55 mRNA sequence - varl (pubHc gi: 6470260) (SEQ ID NO: 377) 

ATGGCGGCGAACCTGAGCCGGAACGGGCCAGCGCTGCAAGAGGCCTACGTGCGGGTGGTCACCGAGAAGT 
^^jCCCCGACCGACTGGGCTCTCTTTACCTATGAAGGCAACAGCAATGACATCCGCGTGGCTGGCACAGGGGA 
-^GGGTGGCCTGGAGGAGATGGTGGAGGAGCTCAACAGCGGGAAGGTGATGTACGCCTTCTG^GAG 
GACCCCAACTCTGGACTGCCCAAATTTGTCCT 
AGGGAGCCTGTGCCAGCC^CGTCAGCACCATGGCCAGCT 

ACGGGCCGAGGAGGATGTGGAGCCTGAGTGCATCATGGAGAAGGTGGCCAAGGCTTCAGGTGCCAACTAC 
AGCTTTCACAAGGAGAGTGGCCGCTTCCAGGACGTGGGACCC 

AGACCAATGCCGTGTCTGAGATTAA7^GGGTTGGTAAAGACAGCTTCTGGGCCAAAGCAGAGAAGGAGGA 

GGAGAACCGTCGGCTGGAGGAAAAGCGGCGGGCCGAGGAGGCAC^GCGGCAGCTGGAGCAGGAGCGCCGG 

GAGCGTGAGCTGCGTGAGGCTGCACGCCGGGAGCAGCGCTATCAGGAGCAGGGTGGCGAGGCCAGCCCCC 

AGAGGACGTGGGAGCAGCAGCAAGAAGTGGTTTCAAGGAACCGAAATGAGCAGGAGTCTGCCGTGCACCC 

GAGGGAGATTTTGAAGCAGAAGGAGAGGGCCATGTCCACGACCT 

CTGAGGAGCCCCTTCCTOCAGAAGCAGCTCACCCAAC 

CCATCTCAAGGCCCAGGGCAGATCTCCCTGCTGAGGAGCCGGCGCCCAGCACTCCTCCATGTCTGGTGCA 
GGCAGAAGAGGAGGCTGTGTATGAGGAACCTCCAGAGCAGGAGACCTTCTACGAGCAGCCCCCACTGGTG 
CAGCAGCAAGGTGCCGGCTCTGAGCACATTGACCACCAC^^ 

TCTGTGCCCGTGCCCTGTACGACTACCAGGCAGCCGACGACACAGAGATCTCCTTTGACCCCGAGAACCT 
CATCACGGGCATCGAGGTGATCGACGAAGGCTGGTGGCGTGGCTATGGGCCGGATGGCCATTTTGGCATG 
TTCCCTGCCAACTACGTGGAGCTCATTGAGTGAGGCTGAGGGCGGCCGCTAGACTAGTCTAGAGAAAAAA 



Human HIP-55 raKNA sequence - var2 (public gi: 8885629) (SEQ ID NO: 378) 

GAAGCTAC^GCAGCGGCGCGGAGACTGCGGGGCGGGCCATGGCGGCGAACCTGAGCCGGAACGGGCCAGC 

GCTGCAAGAGGCCTACGTGCGGGTGGTCACCGAGAAGTCCCCGACCGACTGGGCTCTCTTTACCTATGAA 

GGC^U\.CAGCAATGACATCCGCGTGGCTGGCACAGGGGAGGGTGGCCTGGAGGAGATGGTGGAGGAGCTCA 

ACAGCGGGAAGGTGATGTACGCCTTCTGCAGAGTGAAGG 

CATCAACTGGACAGGCGAGGGCGTGAACGATGTGCGGAAGGGAGCCTC 

GCCAGCTTCCTGAAGGGGGCCCATGTGACC^^ 

TCATGGAGAAGGTGGCCAAGGCTTCAGGTGCCAACTACAGCTTTCACAAGGAGAGTGGCCGCTTCCAGGA 

CGTGGGACCCCAGGCCCCAGTGGGCTCTGTGTACCAGAAGACCAATGCCGTGTCTGAGATTAAAAGGGTT 

GGTAAAGACAGCTTCTGGGCCAAAGCAGAGAAGGAGGAGGAGAACCGTCGGCTGGAGGAAAAGCGGCGGG 

CCGAGGAGGCACAGCGGCAGCTGGAGCAGGAGCGCCGGGAGCGTGAGCTGCGTGAGGCTGCACGCCGGGA 

GCAGCGCTATCAGGAGCA.GGGTGGCGAGGCCAGCCCCCAGAGGACGTGGGAGCAGCAGCAAGAAGTGGTT 

TGW^GGAACCGAAATGAGCAGGAGTCTGCCGTGCACC^ 

TGTCCACCACCTCCATCTCGAGTCCTCAGCCTGGC2^ 

CCAACCAGAGACCCACTTTGGCAGAGAGC<^GCTGCTGCCATCTCAAGGCCCAGGGCAGATCTCCCTGCT 
GAGGAGCCGGCGCCCAGCACTCCTCCATGTCTGGTGCAGGCAGAAGAGGAGGCTGTGTATGAGGAACCTC 
CAGAGCAGGAGACCTTCTACGAGCAGCCCCCACTGGTGCAGGM 

CCACCACATTCAGGGCCAGGGGCTCAGTGGGCAAGGGCTCTGTGCCCGTGCCCTGTACGACTACCAGGCA 
GCCGACGACACAGAGATCTCCTTTGACCCCGAGAACCTCATCACGGGCATCGAGGTGATCGACGAAGGCT 
GGTGGCGTGGCTATGGGCCGGATGGCCATTTTGGCATGTTCCCTGCCAACTACGTGGAGCT 
AGGCTGAGGGCACATCTTGCCCTTCCCCTCTCAGAC^ 

AGTTG ACATT CAGCACTCTTCCAGGAATAGGAC CCCCAGTGAGGATGAGGCCTCAGGGCTCCCTCCGGCT 
TGGCAGACTCAGCCTGTCACCCGAAATGCAGCAATGGCCTG^^ 

CCGACCCTCCCAGACAGCTTGGCTCTTGCCCCTGACAGGATACTGAGCCAAGCCCTGCCTGTGGCCAAGC 

CCTGAGTGGCCACTGCCAAGCTGCGGGGAAGGGTCCT^ 

CTGC^TTTATTTGCCTTTTTTCTTTTTCTC 

CCTTGGGAACAGTGAACGTAGAGAATTGTTTTTAGCAGAGTTTGTGACCA 
GGTTTGGCAGCAGGGAATTTGTCTTGTTGGAGCCTGCTCTGTGCTCCCCA 

GCCTGGGCTATGGGAAGTGGGGATGCAGATGGCCAAGCTCCCACCCTGGGTATTCAAAAACGGCAGACAG 
AACATGTTCCTCCACGCGGCTCAAAAAAAAAAAAAAAAAAA 

Human HIP-55 mRNA sequence - var3 (public gi: 8917572) (SEQ ID NO: 379) 

ATGGCGGCGAACCTGAGCCGGAACGGGCCAGCGCTGK^AAGAGGCCTACGTGCGGGTGGTCACCGAGAAGT 
CCCCGACCGACTGGGCTCTCTTTACCTATGAAGGCAACAGCAATGACATCCGCGTGGCTGGCACAGGGGA 
GGGTGGCCTGGAGGAGATGG^GGAGGAGCTCAACAGCGGGAAGGTGATGTACGCCTTCTGCAGAGTGAAG 
GACCCCAACTCTGGACTGCCCAAATTTGTTCTCATCAACTGGACAGGCGAGGGCGTGAACGATGTGCGGA 
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AC^GAGCCTGTTCCAGCCACGTCAGCACCATGGCCAGCTTCCTGAAGGGGGCCCATGTGACCATCAACGC 
ACGGGCcSgGAGGATGTGGAGCCTGAGTGCAT^ 

AGCTTTCACAAGGAGAGTGGCCGCTTCCAGGACGTGGGACCCCAGGCCCCAGTGGGCTCTGTGTACCAGA 
AGACCAAXG^GTC^^ 

^gaa^Sc^^ 

gaS?otgagSgcgtgaggctgcacgccgggagcagcgctatcaggagcagggtg 

aSSStgggagcagcagcaagaagtggtttcaa^ 

gag^gagattotcajScagaaggagagg^ 

CTGAGSSc^ra 

ccaJctcaaSc^ 

ggcaSagaggasgctgtctatgaggaacctccrgagcaggagaccttcta^ 
cagcagcaaggtgctggctctgagcacattgaccaccacattcagggccaggggctcagtgggcaagggc 

^^gtoccctgta^^ 

catcacgggcatcgaggtgatcgacgaaggctggtggcgtggctatgggccggatggccattttggcatg 
ttccctgccaactacgtggagctcattgagtga 

Human HIP-55 lnRNA sequence - var4 (public gi: 10121214) (SEQ ID NO: 380) 

GGGGCGGGCCATGGCGGCGAACCTGAGCC^ 

accgagaagtccccgaccgactgggctctctttacctatgaaggcaacagcaatgacatc 
.ggac^gccgg^gcgtgagctgcgtgaggctgcacgccggga^^ 

gccagcccccagagtacgtgggagcagcagcaagaagtggtttcaaggaaccgaaatgagcaggagtctg 

gcctggcaagctgaggagccccttcctgcagaagcagctcacccaaccagagacccactttggcagagag 

ccSc?g^gcgatctcaaggccc^ 
g^StSgSgaagaggag^ 

cccactggtgcagcagcaaggtgctggctctgagcacattgaccaccacattcagggccag^ 

ggg^gggctctgtgcccgtgccctgtacgactaccaggcagccgacgacacagagatctcctttgacc 

SgaSaa^StcIcgggcatcgaggtgatcgacgaaggctggtggcgtggctatgggccg^ 

TTTTGGCATGTTCCCTGCCAACTACGTGGAGCTC^ 

TCTCAGA^ATGGCTTCCTTATTGCTGGAAGAGGAGGCCTGGGAGTTGACATTCAGCACTCT 

AGGACCCCCAGTGAGGATGAGGCCTCAGGGCTCCCTCCGGCTTG^ 

CAgHaATOGCCTGGTC^TC^ 

CCCCTGACAG^ATACTGAGCCAAGCCCTGCCTGTGGCCAAGCCCTGAGTGGCC^ 
AAGGGTCCTGAGCAGGGGCATCTGGGAGGCTCTGGCTGCCTTCTGCATTTAm 
^TCCTTCTAAGGGGTGGTGGCCACCACTGTTTAGAATGACCCTTGGGAACAGTGAACGTAGAG^ 

S??SagSgtgaccaaagtcagagtggatca TC 
gSgcctgctctgtgctccc^ctccatttctctgtccctctgcctgggctatgg^ 

atggccaagctcccaccctgggtattcaaaaacggcagacacaac^^ 

tgcctgcaggccccagtgtgtgcctcaactgattctgacttcagga 

ctgttgtcttcccctattttctgtccc^gctcatccgtgtctctgaagaataaatatgct 

aaaaaaaaaa 

HiunanHIP-55 01RNA sequence - var5 (public gi: i044i969) (SEQ ID 'NO: 381) 
c^ccatcaacgcacgggccgaggaggatgtggagcctgagtgcatcatggagaaggtggccaaggcttc^ 

ggtgccaaotacagctttcacaaggagagtggccgcttc^ 

ctgtgtaccagaagaccaatgccgtgtctgagattaaaagggttggtaaagacagcttctgggccaa^ 

agagaaggaggaggagaaccgtcggctggaggaaaagcggcgggccgaggaggcacagcggcagc^ 

caggagcgccgggagcgtgagctgcgtgaggctgcacgccgggagcagcgctatcaggagcagggtggcg 

AGGCCAGCCCCCAAAGGACGTGGGAGCAGCAGCAAGAAGTGGTTrCAAGGAACCGAAATGAG^ 
TGCCGTGCACCCGAGGGAGATTTTCAAGCAGAAGGAGA 

C^GCCTGGCAAGCTGAGGAGCCCCTTCCTGCAGAAGCAGCTCACCCAACCAGAGACCC^CTTTGGCAGAG 
AGCCAGCTGCTGCCATCTCAAGGCCCAGGGCAGATCTCCCTGCTGAGGAGCCGGCGCCCAGCACTCCTCC 

atgtctggtgcaggcagaagaggaggctgtgtatgaggaacctccagagcaggagaccttctacgagcag 

CCC^CACTGGTCCAGCAGCAAGGTGCTGGCT^ 

GTGGGCAAGGGCTCTGTGCCCGTGCCCTGTACGACTACCAGGCAGCCGACGACACAGAGATCTCCTTTGA 



Figure 36 part - 153 



P C T / U S O 4» / O B 3 O S 
Scaatcgcct^^^^^ 

TGCCCCTGACAGGATACTGAGCCAAGCCCTGCCTGTC 
GGARGGGTCCTCAGCAGGGGCATCTGGGAGGCTCTGGCTGCCTTCT 

^CtcSgCTTCTAAGGGGTGGTGGCCACCACTGTT^^ 

?St™gcagagtttgtgaccaaagtcagag^ 

^gIg^tgctctgtgctccccactccatttctctgtccctctgcctgggctatgggaagtg^ 
aIatggccaagctcccaccctgggtattcaaaaacg^ 

gatgcctgcaggccccagtgtgtgcctcaactgattctgacttc^ggaaaagtaacacagagtggcot 
gcctgttgtcttcccctattttctgtccc^gctcatccgtgtctctgaagaacaaatatgcttttggacc 

acgaaaaaaaaaaaaaaaaaaaaa 

Human HIP-55 mRNA sequence - var6 (public gi: 14 04199s) (SEQ ID NO: 382) 

AGCGGCGCGGAGACTGCGGGGCGGGCCATGGCGGCGAACCTGAGCCGGAACGGGCC^GCGCTGCAAGAGG 
CCTACGTGCGGGTGGTCACCGAGAAGTCCCCGACCGACTGGGCTCTCTTTACCTATGAAGGCAACAGCAA 
TGACATCCGCGTGGCTGGCACAGGGGAGGGTGGCCTGGAGGAGATGGTGGAGGAGCTCAACAGCGGGAAG 

GTGATGTACGCCTTCTGCAGAGTGAAGGACCCCA 

CAGGCGAGGGCGTGAACGATGTGCGGAAGGGAGCCTGTGCCAGCCACGTCAGCACCATGGCCAG^ 

GAAGGGGGCCCATGTGACCATCAACGCACGGGCCGAC^AGGATGTGGAGCCT^ 
GTGGCCAAGGCTTCAGGTGCCAACTAC^GCTTCCACAAGGAGAGTGGCCGCTTCCAGGACGTCGGAC^ 
AGGCCCCAGTGGGCXCTGTGTACCAGAAGACCAATGCCGTGTCTGAGATTAAAAGGGTTGGTAAAGACAG 
CTTCTGGGCCAAAGCAGAGAAGGAGGAGGAGAACCGTCGGCTGGAGGAAAAGCGGCGGGCCGAGGAGGCA 

AGGAGCAGGGTGGCGAGGCCAGCCCCCAGAGCAGGACGTGGGAGCAGCAGCAAGAAGTGGTTTCAAGGAA 
'CCGAAATGAGCAGGGGTCAACATGTGCTTCCCTCCAGGAGTCTGCCGTGCACCCGAGGGAGATTTTCAAG 

TOCAGAAGCAGCTCACCCAACCAGAGACCCACTTTGGCAGAGAGCCAGCTGCTGCCATCTCAAGGCCCAG 

SSgatcScctgctgaggagccggcgcccagcactcctccatgtctggtgcaggcagaagaggaggct 

ctSatoaggaaStccagagcaggagaccttctacgagcagcccccactggtgcagcag^ 

gctctgagcacattgaccaccacatccagggccaggggctcagtgggcaagggctctgtgcccgtgccct 

gtgatcgacc^ggctggtggcgtggctatgggcccmatggccattttggcatgttccctgccaactacg 
tggagctcattgagtgaggctgagggcacatcttgcccttcccctctcagacatggc^ccttattgctg 
gaagaggaggcctgggagttgacattcagcactcttccaggaataggaccccc^gtgaggatgaggcctc 
agggctccctccggcttggcagactcagcctgtcaccccaaatgcagcaatggcctggtgattcccacac 
atccttcctgcatcccccgaccctcccagacagcttggctcttgcccctgacaggatactgagccaagcc 
ctgcctgtggccaagccctgagtggccactgccaagctgcggggaagggtcctgagcaggggcatctggg 
aggctctggctgccttctgcatttatttgccttttttctttttctcttgcttctaaggggtggtggccac 
cactgtttagaatgacccttgggaacagtgaacgtagagaattgtttttagcagagtttgtgaccaaagt 
cagagtggatcatggtggtttggcagcagggaatttgtcttgttggagcctgctctgtgctccccactcc 
atttctctctccctctgcctgggctatgggaagtggggatgcagatggccaagctcccaccctgggtatt 
caaaaacggcagacacaacatgttcctccacgcggctcactcgatgcctgcaggccccagtgtgtgcctc 

aaccgattctgacttcaggaaaagtaacacagagtggc 

Human HIP-55 mRNA sequence - var7 (public gi: 15079722) (SEQ ID NO: 383) 
ggcacgagggcggagactgcggggcgggccatggcggcgaacctgagccggaacgggccagcgctgcaag 

AGGCCTACGTGCGGGTGGTCACCGAGAAGTCCCCGACCGACTGGGCTCTCTTTACCTATGAAGGCAACAG 

caatgacatccgcgtggctggcacaggggagggtggcctggaggagatggtggaggagctcaacagcggg 

AAGGTGATGTACGCCTTCTGCAGAGTGAAGGACCCCAACTCTGGACTGCCCAAATTTGTCCTCATCAACT 
GGACAGGCGAGGGCGTGAACGATGTGCGGAAGGGAGCCTGTGCCAGCCACGTCAGCACCATGGCCAGCTT 
CCTGAAGGGGGCCCATGTGACCATCAACGCACGGGCCGAGGAGGATGTGGAGCCTGAGTGCATCATGGAG 
AAGGTGGCCAAGGCTTCAGGTGCCAACTACAGCTTTCACAAGGAGAGTGGCCGCTTCCAGGACGTGGGAC 
CCCAGGCCCCAGTGGGCTCTGTGTACCAGAAGACCAATGCCGTGTCTGAGATTAAAAGGGTTGGTAAAGA 
CAGCTTCTGGGCCAAAGCAGAGAAGGAGGAGGAGAACCGTCGGCTGGAGGAAAAGCGGCGGGCCGAGGAG 
GCACAGCGGCAGCTGGAGCAGGAGCGCCGGGAGCGTGAGCTGCGTGAGGCTGCACGCCGGGAGCAGCGCT 
ATCAGGAGCAGGGTGGCGAGGCCAGCCCCCAGAGCAGGACGTGGGAGCAGCAGCAAGAAGTGGTTTCAAG 
GAACCGAAATGAGCAGGAGTCTGCCGTGCACCCGAGGGAGATTTTCAAGCAGAAGGAGAGGGCCATGTCC 
ACCACCTCCATCTCCAGTCCTCAGCCTGGCAAGCTGAGGAGCCCCTTCCTGCAGAAGCAGCTCACCCAAC 
CAGAGACCCACTTTGGCAGAGAGCCAGCTGCTGCCATCTCAAGGCCCAGGGCAGATCTCCCTGCTGAGGA 
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GCCGGCGCCCAGCACTCCTCCAT6TCTGGTGCAGGCAGAAGAGGAGGCTGTGTATGAGGAACCTCCAGAG 
CAGGAGACCTTCTACGAGCAGCCCCCACTGGTGCAGCAGCARGGTGCTGGCTCTGAGCACATTGACCACC 
ACATTCAGGGCCAGGGGCTCAGTGGGCAAGGGCTCTGTGCCCGTGCCCTGTACGACTACCAGGCAGCCGA 
CGACACAGAGATCTCCITTGACCCCGAGAACCTCATCACGGGCATCGAGGTGATCGACGAAGGCTGGTGG 
CGTGGOTATGGGCCGGATGGCCA3TTTGGCATGTTCCCTGCCAACrACGTGGAGCTCATTGAGTGAGGCT 
GAGGGCAGATCTTGCCCTTCCCCTCTCAGACATGGCTTCCTTATTGCTGGAAGAGGAGGCCTGGGAGTTG 
ACATTGAGCACTCTTCCAGGAATAGGACCCCCAGTGAGGATGAGGCCTCAGGGCTCCCTCCGGCTTGGCA 
^ACTCAGCCTOTCACCCCAAATGCAGCAATGGCCTGGTGATTCCCACACATCCTTCCTGCATCCCCCGAC 
CCTCCCAGACAGCTTGGCTCTTGCCCCTGACAGGATACTGAGCCAAGCCCTGCCTGTGGCCAAGCCCTGA 
GTGGCCACTGCCAAGOTGCGGGGAAGGGTCCTGAGCAGGGGCATCTGGGAGGCTCTGGCTGC^ 
TTTATTTGCCTTTTTTCTTTTTCrOTTGCTTCTAAGGGGTGGTGGCCACCACTGTTTAGAATGACCCTTG 
GGAACAGTGAACGTAGAGAATTGTTTTTAGCAGAGTTTGTGACCAAAGTCAGAGTGGATCATGGTGGTTT 
GGCAGCAGGGAATTTGTCTTGTTGGAGCCTGCTCTGTGCTCCCCACTCCATTTCTCTGTCCCTCTGCCTG 
GGCTATGGGAAGTGGGGATGCAGATGGCCAAGCTCCCACCCTGGGTATTCAAAAACGGCAGACACAACAT 
GTTCCTCCACGCGGCTCACTCGATGCCTGCAGGCCCCAGTGTGTGCCTCAACTGATTCTGACTTCAGGAA 
AAGTAACACAGAGTGGCCTTGGCCTGTTGTCTTCCCCTATTTTCTGTCCCAGCTCATCCGTGTCTCTGAA 
GAACAAATATG CTT TTGGACCACGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Hmnan MP-55 mRNA sequence - var8 (public gi: 21619482) (SEQ ID NO: 384) 

CGGGCCATGGCGGCGAACCTGAGCCGGAACGGGCCAGCGCTGCAAGAGGCCTACGTGCGGGTGGTCACCG 
AGAAGTCCCCGACCGACTGGGCTCTCTTTACCTATGAAGGC^CAGCAATGACATCCGCGTCGCTGGCAC 
AGGGGAGGGTGGCCTGGAGGAGATGGTGGAGGAGCTCAACAGCGGGAAGGTGATGTACGCCTTCTGCAGA 
GTGAAGGACCCCAACTCTGGACTGCCCAAATTTGTCCTCATCAACTGGACAGGCGAGGGCGTGAACGATG 
TGCGGAAGGGAGCCTGTGCCAGCCACGTCAGCACCATGGCCAGCTTCCTGAAGGGGGCCCATGTGACCAT 
CAACGCACGGGCCGAGGAGGATGTGGAGCCTGAGTGCATCATGGAGAAGGTGGCCAAGGCTTCAGGTGCC 
AACTACAGCTTTCACAAGGAGAGTGGCCGCTTCCAGGACGTGGGACCCCAGGCCCCAGTGGGCTCTGTGT 
ACCAGAAGACCAATGCCGTGTCTGAGATTAAAAGGGTTGGTAAAGACAGCTTCTGGGCCAAAGCAGAGAA 

'ggaggaggagaaccgtcggctggaggaaaagcggcgggccgaggaggcacagcggcagctggagcaggag 

CGCCGGGAGCX3TGAGCTGCGTGAGGCTGCACGCCGGGAGCAGCGCTATCAGGAGCAGGGTGGCGAGGCCA 

gcccccagaggacgtgggagcagcagcaagaagtggtttcaaggaaccgaaatgagcaggagtctgccgt 

GCACCCGAGGGAGATTTTCAAGCAGAAGGAGAGGGCCATGTCCACCACCTCCATCTCCAGTCCTCAGCCT 

ggcaagctgaggagccccttcctgcagaagcagctcacccaaccagagacccactttggcagagagccag 

CTGCTGCCATCTCAAGGCCCAGGGCAGATCTCCCTGCTGAGGAGCCGGCGCCCAGCACTCCTCCATGTCT 

ggtgcaggcagaagaggaggctgtgtatgaggaacctccagagcaggagaccttctacgagcagccccca 

CTGGTGCAGCAGCAAGGTGCTGGCTCTGAGCACATTGACCACCACATTCAGGGCCAGGGGCTCAGTGGGC 
AAGGGCTCTGTGCCCGTGCCCTGTACGACTACCAGGCAGCCGACGACACAGAGATCTCCTTTGACCCCGA 
GAACCTCATCACGGGCATCGAGGTGATCGACGAAGGCTGGTGGCGTGGCTATGGGCCGGATGGCCATTTT 
GGCATGTTCCCTGCCAACTACGTGGAGCTCATTGAGTGAGGCTGAGGGCACATCTTGCCCTTCCCCTCTC 
AGACATGGCTTCCTTATTGCTGGAAGAGGAGGCCTGGGAGTTGACATTCAGCACTCTTCCAGGAATAGGA 
CCCCCAGTGAGGATGAGGCCTCAGGGCTCCCTCCGGCTTGGCAGACTCAGCCTGTCACCCCAAATGCAGC 
AATGGCCTGGTGATTCCCACACATCCTTCCTGCATCCCCCGACCCTCCCAGACAGCTTGGCTCTTGCCCC 
TGACAGGATACTGAGCCAAGCCCTGCCTGTGGCCAAGCCCTGAGTGGCCACTGCCAAGCTGCGGGGAAGG 
GTCCTGAGCAGGGGCATTTGGGAGGCTCTGGCTGCCTTCTGCATTTATTTGCCTTTTTTCTTTTTCTCTT 
GCTTCTAAGGGGTGGTGGCCACCACTGTTTAGAATGACCCTTGGGAACAGTGAACGTAGAGAATTGTTTT 
TAGCAGAGTTTGTGACCAAAGTCAGAGTGGATCATGGTGGTTTGGCAGCAGGGAATTTGTCTTGTTGGAG 
CCTGCTCTGTGCTCCCCACTCCATTTTTCTGTCCCTCTGCCTGGGCTGTGGGAAGTGGGGATGCAGATGG 
CCAAGCTCCCCCCCTGGGTATTCAAAAACGGCAGACACAACATGTTCCTCCACGCGGCTCACTCGATGCC 
TGCAGGCCCCAGTGTGTGCCTCAACTGATTCTGACTTCAGGAAAAGTAACACAGAGTGGCCTTGGCCTGT 
TGTCTTCCCCTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human HIP-55 mRNA sequence - var9 (public gi: 23959038) (SEQ ID NO: 385) 

GGCACGAGGATTTGACACATGAATGTATAGCAGTCATTGGGAAACTCCACAGCTCATGTTTTCCTCATAG 
TAGATGTGTGCTCCCATCTCCATGGCTTTGTCCCTCAGAACCCCCACCCCATGGTAAGTCAGGCCAGTGT 
CCTCCCAGCTGCAGAGCTGAGAAGGCTGCACAGTTGCCTACTGAGAACCTGCCTAGTGGGTCAGAGCAAA 
GTGAGAACGX3GCCTGTGCCCACCCACAGTGTTACTGTCAGCCCAAGCTCTTTGGGATGTAGTGGAAAGTC 
ATGGTGGATACGGTGAGGAGAGATGGAACCCAAGGTGCTGGCTACAGAGCTCACTTGTGTTTCGTTTCAG 
GGCTCTCTTTACCTATGAAGGCAACAGCAATGACATCCGCGTGGCTGGCACAGGGGGTGAGTATGACTCC 
AAATGGACTCAGGGACACCAGGAGGTAGGAGGGTGACGACGAGGGGTCAGCGCACTCAGCTGTCTTGGTC 
CACTGAGCCACATGGGGCTTCCAGTGCTCACTGGCCACTTCTGGCAGGCCTTAGGTTTCAGATATGTGTA 
AGTGAAAACATTCCTCCTTGGTTCTCCTTCCCTCTGGGTGCAGGGGAGTGCTTTCTCTTTGTCTACTTGG 
GGAGAGCTGAGAGGGAAACAGGCCTCTCCAGCTTGTGGGCAGCCTGCGTTGGGAGCTGCGGTGGGAAGCT 
CACCAGTCCCAGAACTGGTGCTGGTGGAAAGAAAGTCCACAGACATATCTTCTTCTCCCTTTGTCCTGCC 
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CTGGTCTTGTGCCGAGTGCTTGCAGGGGCCCCATCCTCACTGGGAGAGGCAGT 

CGGGGGAGGCTCTGGAGGTCTCTACAGGAAGGACAGGCTOTTGGCCAGCACAGAGCAGAGGTTGTCAGG^ 
TAGGCTTCGTCAGAGTGTGACCTGTGGGCCCCTCAGCTGACACCCGTGACTGCTCCTCCTCCAGAAGTTG 
CCTGACCCCTCCCTCTGTCCTGAGCTGGACATGGCTTCATTGTTCAATGAACACTCGGAGTGGTTCTCCA 
CGGTTTGATGTCTGTTGTTGGTAGAAAGCCCCTTCCTTTCACAATCTTTCTGGGAGGTGTCCCCTTTCTA 
GAAGGATTGCCATTGAACAGTAGAGATGTGGTGTGGCAGGTGACTGGGAGTTGCAGAGATCAACAACTTG 
AGAGTTTCCTGTCATCCCCAGTGGCACAGGACAGGGCTCTC 

AGTGGGGCTCATGACTGCCTCCACTCATACGGAGCCCTGTAGATGAAATACCTGATCAGCTCTTCCTCCT 

TATAACCTGGAAAAGTTTGTGAQSGCTAAGCCTCAGTGTC^GGGAGAATTGTTTAGAGCTGCCCACTCCT 

GTGCTCCCCCTGTCCCCAT CACCCTCTCnTCTGGAGTCTGAGGACTGAG CCAGTTACGCCACTGCAGGAT 

GTTCAATCTGGTCTGGCCGTCTGGGTGGCCCTGGAACTTGAGCAGAC^C^GGTGCAGGCAGTGGTGACTC 

TACAGGCCCTGCTATTCCGGGCCCTTTTGCAACGTTGTGGCAACAATAAAATTTTGACGTAGC 

CATTTGGAAGTCTGGTGGCTGGTTTGCCGTGGAAATGACCCTGTTTTTATTTCCAGAATTACCTCTGGGT 

TTAGAGAAGTGGTTTTTAAACGAGTGTGGGTAAAAAAATTACCTGAGGTACTTGTCAGAATCGCAGACTT 

CTAGGTCCCACCCAGCTCTCATCAATCAGTTTAGTGAGGGTC 

CCCTAGAAAGATTCTGATACAGGTAGAGGTGAGAAGCCCTGGTTTAGAAG^GCTCGGCCTCCCTTCATG 
GTGGGACCAGGGCCAGCAGGGAATGTCAGGGCCACCCCTGACCTTCACTGTGACTCTGCTGCAGAGGGTG 
GCCTGGAGGAGATGGTGGAGGAGCTGAACAGCGGGAAGGTGATGTACGCCTTCTGCAGAGTGAAGGACCC 
CAACTCTGGACTGCCCAAATTTGTCCTCATCAACTGGAGA.GGCGAGGGCGTGAACGATGTGCGGAAGGGA 
GCCTGTGCCAGCC^CGTC^GCACCATGGCCAGCTTCCTGAAGGGGGCCaVTGTGACCATCAACGCACGGG 
CCGAGGAGGATGTGGAGCCTGAGTGCATCATGGAGAAGGTGGCC^GGCTTCAGGTGCCAACTACAGCTT 
TCACAAGGAGAGTGGCCGCTTCCAGGACGTGGGACCCCAGGCCCCAGTGGGCTCTGTGTACCAGAAGACC 
AATGCCGTGTCTGAGATTAAAAGGGTTGGTAAAGACAGCTTCTGGGCCAAAGCAGAGGTGAGTGCTGCCC 
CGGGGCATGCTGGGCACGTGGGAGTGTTCTGCTTGCTGTGGCTCATCTTTCCTCACAAGTGAGCTCATGC 
AGCATCCACTCTCCTTGGTGCCCATTACAGATGGTCACACTGAGGCTCGGGTAAGTTAAGCCACAAGGCT 
AATGATCGACTGGCTCTGGTGCCCGTCTTTGGCCATGTGCCTAAAACTC^GTCCTGGGCAGGGGATTAGG 
CTGAAGTGGCAGCATAGGGCTGAGCGGGCAGTGGCTCTCCCTGCAGAAGGAGGAGGAGAACCGTCGGCTG 
-GAGGAAAAGCGGCGGGCCGAGGAGGCACAGCGGCAGCTGGAGCAGGAGCGCCGGGAGCGTGAGCTGCGTG 
AGGCTGCACGCCGGGAGC^VGCGCTATCAGGAGCAGGGTGGCGAGGCCAGCCCCCAGAGGACGTGGGAGCA 
GCAGCAAGAAGTGGTTTCAAGGAACCGAAATGAGCAGGAGTCTGCCGTGCACCCGAGGGAGATTTTCAAG 
CAGAAGGAGAGGGCCATGTCC^CCACCTCCATCTCCAGTCCTCAGCCTGGCTIAGCTGAGGAGCCCCTTCC 
TGCAGAAGCAGCTCACCCAACCAGAGACCCACTTTGGCAGAGAGCCAGCTGCTGCCATCTCAAGGCCCAG 
GGCAGATCTCCCTGCTGAGGAGCCGGCGCCGAGCACTCCTCCATGTCTGGTGC^GGCAGA 
GTGTATGAGGAACCTCCAGAGCAGGAGACCTTCTACGAGC^GCCCCCACTGGTGCAGCAGCAAGGTGCTG 
GCTCTGAGCACATTGACCACCACATTCAGGGCCAGGGGCTCAGTGGGCAAGGGCTCTGTGCCCGTGCCCT 
GTACGACTACCAGGCAGCCGACGACACAGAGATCTCCTTTGACCCCGAGAACCTCATCACGGGCATCGAG 
GTGATCGACGAAGGCTGGTGGCGTGGCTATGGGCCGGATGGCCATTTTGGCATGTTCCCTGCCAACTACG 
TGGAGCTCATTGAGTGAGGCTGAGGGCACATCTTGCCCTTCCCCTCTCAGACATC 

GAAGAGGAGGCCTGGGAGTTGACATTCAGCACTCTTCCAGGAATAGGACCCCCAGTGAGGATGAGGCCTC 
AGGGCTCCCTCCGGCTTGGCAGACTCAGCCTGTCACCCCAAATGCAGCAATGGCCTGGTGATTCCCACAC 
ATCCTTCCTGCATCCCCCGACCCTCCCAGACAGCTTGGCT 

CTGCCTGTGGCCAAGCCCTGAGTGGCCACTGCC^GCTGCGGGGAAGGGTCCTGAGCAGGGGCA 

AGGCTCTGGCTGCCITCTGCATTTATTTGCCTTTTTTCTTTTTOT 

CACTGTTTAGAATGACCCTTGGGAACAGTGAACGTAGAGAATT^^ 

CAGAGTGGATCATGGTGGTTTGGCAGCAGGGAATTTGTCTTGTTGGAGCCTGCTCTGTGCTCCCCACTCC 

ATTTCTCTGTCCCTCTGCCTGGGCTATGGGAAGTGGGGATGCAGATGGCCAAGCTCCCACCCTGGGTATT 

CAAAAACGGC^GACACAACATGTTCCTCCACGCGGCTCACTCGATGCCTGCAGGCCCCAGTGTGTGCCTC 

AACTGATTCTGACTTCTVGGAAAAGTAACAGAGAGTGGCAAA^^ 

AAAAAAAAAAA 

Human HIP-55 protein sequence - varl (public gi: 21619493) (SEQ ID NO: 390) 

MAANLSRNGPALQEAYVRVVTEKS PTDWALFTYEGN^ 

DPNSGLPKFVLINWTGEGVITOTOKGACASHVSTMASFLKGAHVTINARAEEDVEPECIMECT 

S FHKESGRFQDVGPQAPVGSVYQKTNAVS E I KRVGKDS FWAKAEKEEENRRLEEKRRAEEAQRQIiEQERR 

ERELREAARREQRYQEQGGEASPQRTWEQQQEWSRNRNEQESAVHPREIFKQKERAMSTTS ISSPQPGK 

LRSPFLQKQLTQPETHPGREPAAAISRPRADLPAEEPAPSTPPCLVQAEEEAVYEEPPEQETFYEQPPriV 

CXJQGAGSEHIDHHIQGQGIiSGQGLCARALYDYQAADDTEISFDPENIjITGIEVIDEGWWRGYGPDGHFGM 

FPANYVELIE 

Human HIP-55 protein sequence - var2 (public gi: 15079723) (SEQ ID NO: 391) 

MAANLSRNGPALQEAYVRWTEKSPTDWALFTYEGNS 
DPNSGLPKFVLINWTGEGVNDVRKGACASHVSTMASFL^^ 
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S FHKES GRFQDVGPQAPVGS VYQKTNAVS E I KRVGKDS FWAKAE KEEENRRLEEKRRAEEAQRQLEQERR 
ERELREAARREQRYQEQGGEASPQSRTWEQQQEVVSRNRNEQESAVHPREIFKQKERAMSTTSI^ 
KLRSPFLQKQLTQPETHFGREPAAAISRPRADLPAEEPAPSTPPCLVQAEEEAVYEEPPEQETFYEQPPL 
VQQQGAGSEHI DHHIQGQGLSGQGIjCARAIiYDYQAADDTEI S FDPENIj I TGIEVIDEGWWRGYGPDGHFG 
MFPANYVELIE 

Human HIP-55 protein sequence - var3 (public gi: 14041996) (SEQ ID NO: 392) 

MAANLSRNGPALQEAYVRVVTEKSPTDWALFTYEGNSN^ 
DPNSGLPKSVLIIWTGEGVNDVRKGACASHV^ 

S FHKESGRFQDVGPQAPVGSVYQKTNAVSE I KRVGKDS FWAKAEKEEENRRLEEKRRAEEAQRQIjEQERR 
ERELREAARREQRYQEQGGEAS PQSRTWEQQQEWSRNRNEQGSTCASLQESAVHPRE I FKQKERAMSTT 
SISSPQPGKI^SPFLQKQLTQPETHFGREPAAAISRPRADLPAEEPAPSTPPCLVQAEEEAVYEEPPEQE 
TFYEQP PLVQQQGAGSEHIDHHI QGQGLSGQGLCARALYDYQAADDTE I S FDPENLITGI EVIDEGWWRG 
YGPDGHFGMFPANYVELIE 

Human HIP-55 protein sequence - vax4 (public gi: 10441970) (SEQ ID NO: 393) 

MEKVAKASGANYS FHKESGRFQDVGPQAPVGSVYQKTNAVSE I KRVGKDS FWAKAE KEEENRRLEEKRRA 
EEAQRQLEQERRERELREAARREQRYQEQGGEAS PQRTWEQQQEWSRNRNEQESAVHPREI FKQKERAM 
STTSISSPQPGKLRSPFLQKQLTQPETHFGREPAAAI SRPRADLPAEEPAPSTPPCLVQAEEEAVYEEPP 
EQETFYEQPPLVQQQGAGSEHIDHHIQGQGLSGQGLCARALYDYQAADDTEISFDPENLITGIEVIDEGW 
WRGYGPDGHFGMFPANYVELIE 

Human HIP-55 protein sequence - var5 (public gi: 10121215) (SEQ ID NO: 394) 

MAANLSRNGPALQEAYVRWTEKS PTDWALFTYEC3JSNDI RVAGTGEGGLEEMVEELNSGKVMYAFCRVK 
DPNSGLPKFVL INWTGEGVNDVRKGACS SHVSTMAS FLKGAHVTINARAEEDVEPECIME KVAKASGANY 
• S FHKESGRFQDVGPQAPVGSVYQKTNAVSE I KRVGKDS FWAKAE KEEENRRLEEKRRAEEAQRQLEQERR 
ERELREAARREQRYQEQGGEASPQSTWEQQQEWSRNRNEQESAVHPREIFKQKERAMSTTSISSPQPGK 
LRSPFLQKQLTQPETHFGREPAAAISRPRADLPAEEPAPSTPPCLVQAEEEAVYEEPPEQETFYEQPPLV 
QQQGAGSEH I DHH I QGQGIiSGQGLCARALYDYQAADDTE I S FDPENIiI TG I EVIDEGWWRG YGPDGHFGM 
FPANYVELIE 
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